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Chapter l 
INTRODUCTION 

The publication of "Experimental and Quasi-Experimental 
Designs for Research on Teaching" by D. T. Campbell and 
J. C, Stanley in the Handbook of Research on Teaching (1963) 
represents a major advance in educational- research method- 
ology. Already this chapter has become a much-used refer- 
ence in numerous depaartments of psychology and sociology as 
well as in schools of education. The. Campbell- Stanley chap- 
ter was a presentation of various designs for the assessment 
or comparison of treatment effects with a discussion of 
their strengths and weaknesses. "In general, [the multiple- 
group time— scries design] is an excellent guasi— experimental 
design, perhaps the best of the more feasible designs.... 

The availability of repeated measurements makes [the design] 
particularly appropriate to research in schools" (Campbell 
and Stanley, 1963, p. 227) . To date, appropriate inferen- 
tial statistical analyses of treatment effects in all types 
of time-series experiments have not been developed. Campbell 
and Stanley (1963) and Campbell (1963) have lamented the lack 
of appropriate analytic techniques for important time-series 
quasi- experimental designs. 
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Recent developments in mathematical statistics by Box 
and Jenkins (1962) and Box and Tiao (1965) provide statisti- 
cal models which may very well fill the need identified by 
Campbell and Stanley. Broadly conceived, the objective of 
this project was the investigation of the adequacy of statis- 
tical models developed by Box and Tiao for the analysis of 
time-series quasi-experiments. This investigation involved 
the following: (1) the investigation of the Box-Tiao models 

as to their adequacy as descriptions of time- series experi- 
mental data, (2) investigation of the possibility of exten- 
sion of the models of Box and Tiao to the analysis of more 
general classes of time-series quasi- experiments, (3) the 
development of computer programs for statistical analysis 
based on the models, (4) the application of the models to 
the analysis of actual time-series quasi-eiq>eriments. 

The Time-Series Experiment 

The time- series experiment was identified and discussed 
at length by Campbell and Stanley (1963) in Gage's Handbook 
of Research on Teaching and by Campbell (1963) • This singu- 
larly useful design for estperimental research has long been 
a paradigm for experimentation in the physical sciences. 
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In the social sciences/ and education, a person or group of 
persons might be observed and measured at regular intervals 
prior to the introduction of an experimental treatment (T) • 
Several observations on the group follow the introduction of 
T. An abrupt change in the level of the average score for 
the group between the observation immediately preceding T 
and those following it may indicate a cause and effect rela- 
tionship between T and the variable being measured. Note 
how in the figure below the introduction of T (a new curric- 
ulum perhaps) appears to have increased the "achievement" 

. of the class. 




A multiple-group time-series quasi-e:^eriment might 
involve three treatments (T^, ^ 2 ^ ^ 3 ^^ each applied simul- 
taneously to a different group. One would then compare the 
three gains between times 4 and 5. Another modification of 
the basic time-series design might be called a dependent- 
groups time-series design. In such designs, the same group 
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of persons is involved in multiple time-series designs for 
the evaluation of more than one treatment (T) effect. 

Analysis of Time-Series Experiments 

A crucial problem, identified but not resolved by 
Campbell and Stanley, is that of the decision whether the 
gain in "achievement" between times 4 and 5 should be attri- 
buted to random fluctuations in the time-series curve or to 
an outside influence (presumably T, the new curriculum) • 

This is an inferential statistical problem. Campbell and 
Stanley (1963) considered several significance-testing pro- 
cedures for analysis of the time- series design; in the end, 
each procedure suffered either from implausible assumptions 
or weaknesses such as disregard for most of the data or lack 
of power (i.e,, failure to produce significant results when 
any reasonable observer would attribute an effect to T) . 

A simple correlated jt-test of the difference between 
the pre-treatment and post- treatment observations was judged 
inappropriate by Campbell and Stanley because it would pro- 
duce significant results for nonstationary time-series in 
which no abrupt change occurred. A test for the deviation 
of a value from the regression line derived on all observa- 
tions preceding it (Mood, 1950, pp. 297-298) was considered. 
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The dependence of this test on the unrealistic (for time- 
series analysis) ass\imption of independent observations 
makes it inappropriate. By far the greatest amount of 
research into the problem of analysis of time- series exper- 
iments has been accomplished under the direction of Donald 
T. Campbell at Northwestern University. Much of the work 
of Campbell and his associates has not yet been published. 

Box and Tiao (1965) presented a statistical model for 
the analysis of the change in level of a nonstationary time- 
series. The problem they considered was that of making 
inferences about a possible shift in the level of a time- 
series associated with the occurrence of an event, which 
we have called T. This model (an integrated moving average 
model) is based on statistical assumptions likely to be met 
in many time-series quasi-experiments. The integrated moving 
average model is reported to represent quite well a surpris- 
ingly lasrge number of time— series in economics and industry, 
but it is practically \anknown to behavioral scientists and 
educational researchers. 

The mere existence of statistical models and analysis 
procedures does not guarantee their usefulness for educa- 
tional or psychological research. The economist and chemist 
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have many statistical models that are of no use to behavioral 
scientists. Response surface methodology (Box, 1954) was 
designed (primarily for the chemist) to answer questions 
about optimal combinations of many factors; the method is 
an invaluable tool for the chemist. However, due to the 
generally small ratio of treatment effects to experimental 
error which plagues researchers in the social sciences, they 
rarely use response surface methodology. One focus of the 
research reported here was to determine whether the methods 
of time*” series analysis developed by Box and Tiao suffer 
from the same inability to distinguish treatment effects 
when they are small compared to error. 
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The remainder of this report falls into four chapters 
and two appendices. In Chapter II, the basic model devel- 
oped by Box and Tiao (1965) is applied to actual time-series 
experiment data from two separate experiments, one in psych- 
ology and the other in educational psychology. In Chapter 
III, the same model is applied in a relatively complex 
experimental design to analyze data on traffic fatalities 
in the state of Connecticut before and after a legislative 
crackdown on speeding. A generalization by George C. Tiao 
of the basic model — as yet unpublished — is applied to 
data on the effects of a revision of Germany's divorce laws 
in 1900 on the rate of divorce, in Chapter IV. This genei>- 
alized model incorporates a "drift" parameter which accom- 

I 

modates series which show either a rise or a fall over time. 
In Chapter V a model which is a particular generalization of 
the "drift" model in Chapter IV is presented. The most gen- 
eral model in Chapter V incorporates a parameter' to accoxint 
for an instantaneous change in the direction of the drift of 
a series associated with the introduction of a treatment. 

The general model of Chapter V was also developed for this 
study by Dr. Tiao. In Appendices A and B appear print outs 
of computer programs for the analyses illustrated' in Chap- 
ters IV and V, respectively. 
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Chapter II 



ANALYSIS OF A CLASS OF TIME-SERIES QUASI-EXPERIMEMTS 

WITH THE BOX-TIAO MODEL 

In a recent article. Box and Tlao (1965) developed a method of 
evaluating the change in level between two successive points in time of 
a non-stationary time-series. Observations are taken at equally 
spaced time Intervals and one wishes to make inferences about a possible 
shift In level of the time-series associated with the occurrence of an 
event at a particular point in time. This is precisely the situation 
described by Campbell and Stanley (1963) as a time-series quasi-experl 
mental design. Several observations are taken before and after the 
administration of a treatment^ T, e.g., 0^ 0^ 0^ ^ 0^ 0^ 0^. If there 
is an abrupt shift in the level of a time-series between the third and 
fourth observations, evidence of a treatment effect may exist. Campbell 
and Stanley recognized the shortcomings in the statistical tests they 
suggested as possible analytic techniques (Campbell, 1963; Campbell and 
Stanley, 1963) . The model and statistical techniques developed by Box 
and Tlao (1965) appear to be the most suitable methods now available 
which might have application to the analysis of time-series quasi- 
experiments . 

The statistical model underlying the Box-Tlao analysis .of change 
In level of a time-series is the integrated moving average model. 

t-i 

z^ ■ and z^ ■ L + y 2 '^t-1 ^ *t (3.1a)* 
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for the xi^ observations prior to the introduction of T, and 



*£quatlon numbers are the same as those in Box and Tlao (1965) • 
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for the £2 = N - observations following T, where: 

is the value of the variable observed at time 

L is a fixed but unknown location parameter, 

Y* is a parameter descriptive of the degree of 

interdependence of the observations in the time-series and 
takes values 0 < v < 2, 

is a random normal deviate with mean 0 and variance o’ . 

5 is the change in level of the time-series caused by T. 

Essentially the model implies that the system is subjected to periodic 
random shocks (pt^) a proportion (y) of which are absorbed into the level of the 
series. Data which conform to the model in (3.1a) evidence the following 
properties (among others): 

1. The graph of the time-series follows an erratic, 
somewhat random path with slight, but no systematic 
drifts, trends, or cycles. 

2. The correlogram (i.e., the graph of the auto- 
correlations) of the observations, z^, does not 
I'die out," (i.e., does not tend systematically 
toward zero as the lag between values correlated 
increases) nor does it show cycles characteristic 
of cyclic time-series. 

3. For the N-1 differences between adjacent 

observations, z^ - the lag 1 correlation 

equals (y* - 1)/(1 + (y • 1)^1 higher lag 

correlations equal zero. 



the model can 



t **2 



By setting and y^. « Zj. - r ^2^(1 “ r)^ ^t-l-j» 

be written as Y « X 6 e where X is defined as an N x 2 matrix of weights as 
follows: V 



xJ 



1 (1 - r) ... (1 - I (1 - r)"^ . . . (i - r) 



N-l 



• f • 



I 1 (1 - r). . . a - r)"2-i 






6 is a 2 X 1 vector containing as elements L and & and e is an N x 1 vector of 
random normal deviates, ... Ofjj) , the elements of which have mean 0 

O 

and variance o*^. 

I^en X known, simple least squares estimates of L and 6 can be 
found from the familiar solution to the least-squares normal equations: 






✓S 

L 

V J 



(X^X) * ^ x’^Y 



(3.4) 



Box and Tiao showed that the least squares estimate of 6, namely, &, 
has a £ distribution with N - 2 ^ when divided by an appropriate estimate of its 
standard error. 

When X unknown (as will generally be true) a Bayesian analysis using 
sample information about y is used in making inferences about 5* The posterior 
distribution, h(y)z), of y given a set of N observations and assuming a uniform 
prior distribution is known to within a constant of proportionality. 



h(y|z) oc. 



Y(2-r) 



-V[l-(l-y)2njL] [i-(i-y)2n2] 



(s^) 



where s^ is the residual variance and is given by 

s2.^(T -epfx^xe), 

N-2 

for a given value of 



(5.8) 



(3.18) 
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Computing Procedures 



The present program performs the following operations: 

1. The correlograms (autocorrelations for lag 1 through lag 3n]^/4 or 3n2/4) 

are calculated for and for z^ - separately for pretreatment and 

posttreatment data. 

2. The posterior distribution, h(y)z), of f given the data z for a uniform :;rlor 
distribution is claculated and plotted. The value of hCyf z) is found for 
200 values of y from .01 to 2.00 in steps of .01. For each of these 200 
values of Y*, 

3. is calculated, 

4. The variance error of 'd is calculated 

5. The t-statistic equal to the ratio of ^ to its standard error is calculated 
and plotted on a graph which is superimposed on the graph of h(Y*|z). 

Formation of the matrix of weights X in (3.3) raises problems with 

•• 

"underflow." As y approaches 1.0, successive elements of X get very small. 
Consequently, when ) Xj.]^) < 1.0 x 10*^^, the effects of the weights can be 
considered negligible and each subsequent value of x^^ is set equal to zero for 
that Y*. The same problem and treatment apply to x^ 2 * 

The vector^ is formed by equation (3.4.) and the residual variance s^ 
calculated from formula (3.18). The standard error of ^ is given by 

(2-r^ a>-2) ^ 

The t value for testing the significance of the difference of S from 
0 is given by « ^/a(6lY‘) » 
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Special problems of computer accuracy arise in the calculation of the 
posterior distribution of y When s^ is large, the posterior distribution can 
be very small, since s*^W~2) ^^3 3 multiplicative factor in formula (5*8) • To 

P 

prevent any problems associated with this calculation, all factors are transformed 
to logj ^Q prior to the calculation of the ordinates of the posterior distribution, 
which are then transformed by subtracting the largest value thus obtained. Thus, 
all values of the posterior distribution are divided by the maximum value. With 
one slight exception, antilogs are taken to restore the values to their original 
form. The exception is that when the log of a given value differs from the log 
of the maximum value by more than 33 {i.e., given value / maximum value <10 ), 

the value of the ordinate corresponding to the given' value is set equal to zero. 

Each of the values of^, s^, is stored for each 

value of x» values of the posterior distribution of y are rescaled by' 

fitting trapezoids so that the curve has unit area.) 



Illustrative Analysis 

An illustrative analysis will be performed on data from an experiment 
by Deese and Carpenter which was adapted for presentation in Brown (1961, pp. 118 
119). Two groups of rats were given 24 training trials in running a short alley 
for food. Group A had been fed wet mash for one hour prior to the experiment; 
group B had not eaten for 22 hours. After 24 trials the conditions were 
reversed, group A being deprived of food for 22 hours and group B being fed for 
one hour prior to a final eight trials. Observations were made of the length of 
time between start of a trial and a running response for each rat. Observations 
were converted to logarithms of this latency period for each rat which were then 
averaged and divided into 1 for both groups. The reciprocals, of the average log 
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latencies for groups A and B.over 32 trials appear in Figure 1* Data for Group 
B (high drive followed by low drive state) appear as the solid line; the broken 
line is for Group A. 

Figure 1 about here 



The significance of the effect of shifting from a low drive state to a 
high drive state (Group A) is apparent and would not be enhanced by further 
statistical analysis. However, the significance of the slight downward shift 
between the 24th and 25th trials for Group B is worthy of further investigation. 

Assuming that the fundamental time-series process generating the series 
is the same for both Groups A and B, the n^^ “ 24 pretreatment observations for 
both groups should provide a reasonably good, though somewhat unstable, estimate 
of the correlograms for raw data, z^, and differences, z^ - *t-l* H 2 “ ® 

posttreatment observations are insufficient in number to add any substantial 
information concerning the fit of the model. Even the small number of pretreat- 
ment observations probably represents fewer than a minimal number from which one 
may draw inferences about the fit of the model with any confidence. The correlo- 
grams for Groups A and B for the data in Figure 1 were calculated. To conserve 
space these correlograms are not reproduced here. These two sets of autocorrela- 
tions evidenced neither cycles nor systematic dampening effects characteristic of 
time series of types other than moving average series. On the basis of inspec- 
tion of Figure 1 and the correlograms of the original data. we can continue to 
entertain the model of equations (3.1a) and (3.1b). 

The adequacy of the model is further investigated by observing the 
correlogram of the differences between adjacent observations in a series, i.e., 




the correlogram of ~ ^t* •••> N-l,The correlograms were calculated 

for the pretreatment data in Figure 1. The lag 1 through lag 10 autocorrelations 
of the differences between successive observations appear in Figure 2. 



Figure 2 about here 



The correlogram for z^ - z^.j^ where z^ conforms to the model in (3.1a) 

should show a lag 1 correlation of (y - 1) (1 + (y - 1)^] and lag k autocorrela- 

tions of zero (k > 1). For sample data, an approximation to the standard error 
of an autocorrelation coefficient due to Bartlett (1946) is available. The 
slanted, straight lines at the top and bottom of Figure 2 mark off a distance of 

two standard errors of the autocorrelation coefficient of lag k, 2cr . Note 

"" ‘k 

that only one (lag 9 - Group B) of the 18 autocorrelation coefficients in Figure 

2 lies in a region of rejection that the population value of an autocorrelation 

of lag greater than 1 is zero. 

As will be seen later, the maximum likelihood estimates of y are 0 and 
.25 for Groups A and B, respectively. The lag 1 autocorrelation for group A is 
almost equal to the expected (on the basis of the model) value of (y - 1)/[1 
(y - 1)^] a» -.50. If the model fits exactly, one would expect a lag 1 auto- 
correlation for Group B of (.25 - 1)/[1 + (-.75)^] » -.48. The obtained value 
of -.28 does not differ significantly from this expected value. Acknowledging 
the limited power of these statistical tests and the fact that to accept uncriti- 
cally a model on the basis of so few observations is largely a matter of faith, 
we proceed with the analysis of the data for Group B assuming that the model of 
(3 *13) and (3.1b) holds. 
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In Figure 3 appear the following: (a) the posterior distribution * 

of Y> which indicates the likely values which y might assume for these 
data (the maximum likelihood estimate of y is found by noting that value 
of Y under the peak of the curve) ; (b) the ^-statistic used in testing 
the hypothesis that 6, the shift in level of the series associated with 
the events between trials 24 and 23, is equal to zero; (c) the values 
(dotted lines) of ^ needed for significance at the .15, .10 and .05 
levels in testing the hypothesis 6 » 0 against the hypothesis 

£ < 0. Note that if Y * can be rejected at the .05 level 

in favor of If y is *50 or above (which appears relatively unlikely) 

cannot be rejected at the .15 level. Our Impression is that the data 
do indicate a statistically significant shift downward after the 24th 
trial for Group B. 

The data in Figure 4 are the number of times in a 50-minute period 
in school that a four-year-old hyperactive child changed activities over 
28 consecutive days. (Allen at al . » 1967) For the last seven days, 
the child was given verbal, social. reinforcement for attending to a 
single activity for more than a minute. The question: Is there a 

decrease in activity changes associated with introduction of reinforce- 
ment of attending behavior between days 21 and 22? 

The decrease of approximately 25 activities in 50 minutes from day 
21 to day 22 is less than the "natural” decrease of the time-series 
between days 3 and 4 and days 14 and 15. Can the activity of the time- 
series over the last seven days be viewed as the regular progression of 
the time-series over the first 21 days? An inferential statistical 

analysis will illuminate these data. 
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The data in Figure 4 show neither upward nor downward trends 
cycles. The lag 1 autocorrelation coefficient for the differences 
between successive pre-treatment observations is -.5, which corresponds 
closely to the expected value when y ^ 0 (which is the maximum likeli- 
hood estimate of y for these data) . The other autocorrelations for the 
differences are not significantly different from zero. Thus the data 
in Figure 4 appear to conform to the model. 

As was pointed out earlier, the objective of the analysis of a pos- 
sible change in level of a time-series of the integrated moving average 
type is to obtain a least-squares estimate of 6 and a distributional 
statement about the estimate. Provided y is known, this objective is 
relatively easily attained. The least-squares estimate of 6, namely 6, 
and an estimate of its standard error were given in Box and Tiao in their 
1965 paper. Under suitable assumptions of normality and independence of 
errors, (6 - 6)/o(6) has a ^-distribution with n^ + n^ - 2 degrees of 
freedom. A test of the hypothesis that 6 0 can be carried out with 

the test statistic « 6/o(6). 

However, if y is not known (as is generally true), it is necessary 
to obtain information from the sample of N observations about probable 
values for y. Either of two strategies might be followed; 1) find the 
maximum likelihood estimate of y and estimate 6 and a (6) using only that 
maximum likelihood estimate; 2) plot both the likelihood distribution of 
Y and the value of _t ** 6/a (6) against the value of y as it ranges between 
0 and 2 and see if the t^statistic is clearly significant or non-signi 
ficant over the range of probable values for y. [The likelihood distri- 
bution of y given the N sample observations is given in Box and Tiao 
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(1965) J Both strategies will be employed at different points in the analy- 
ses to follow. 

Of course » y is unknown. The likelihood distribution of y given 
the data is plotted as h(Y|z) in Figure 5. The chances are practically 
nil that y is above .10; the maximum likelihood estimate of y is zero. 

Over the range of likely values of y> 0 to .10, the value of Jl for 
testing the hypothesis that 5 ■ 0 is clearly statistically significant 
is never greater than -4.4). Without much question, then, the intro- 
duction cf the treatment at day 21 worked an effect upon the rate of 
activity change. 
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Figure 1. Starting speeds for two groups of rats be for 
and after a reversal in deprivation schedule 
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FIG. 4. NUMBER OF ACTIVITY CHANGES IN A 50-MINUTE PERIOD BY S OVER 

28 DAYS. SOCIAL REINFORCEMENT GIVEN FOR ATTENDING BEHAVIOR 
EXCEEDING ONE MINUTE OVER THE LAST SEVEN DAYS. 
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Chapter III 
ANALYSIS OP DATA 

ON THE CONNECTICUT SPEEDING CRACKDOWN 
AS A TIME-SERIES QUASI-EXPERIMENT 

In late 1955 in Connecticut, the number of fatalities per 100,000 
population in motor vehicle accidents reached a record high for the 
1950 *s. On December 23, 1955, Governor Abraham Ribicoff took unpre- 
cedented legislative action to reduce traffic fatalities. Ribicoff 
announced that persons convicted of speeding v/ould have their li- 
censes suspended for thirty days at the first offense, for sixty 
days at the second offense, and for an indefinite period (subject 
to a hearing after ninety days) at the third offense. Data on traf- 
fic fatalities before and after the Connecticut crackdown on speed- 
ing can be regarded as a time-series quasi-experiment (Campbell and 
Stanley, 1963; Campbell, 1963) with some significance for the social 
sciences. When supplemented with traffic fatality data for the states 
of Massachusetts, Rhode Island, New York, and New Jersey,' the col- 
lection of observations can be viewed as a multiple-group time-series 
experiment (Campbell and Stanley, 1963; Campbell, 1963). The mul- 
tiple-group time-series design can be diagrammed as follows: 



ipi, II WIIHPIII, I. .11 I. ■pitiuil WilWIWlJi 

. ^ - ^ 



1 



Time 



• • • “ 1 



n. 



+ 1 



*J“ 2 • • • 



Conn.: ...o 0 
Mass.: ...o . 0 
R.X • : • • • 0 0 
N.Y. : ...0 0 
N.J. : ,,,o 0 



T 0 0. . 

0 0 .. 

0 0 .. 

0 0 .. 

0 0 .. 



The O's represent monthly observations of traffic fatalities 
for the n^ months prior to T# the treatment, and for the n^ observa- 
tions following T. The treatment, T, is the Governor's crackdown on 
speeding in the state of Connecticut. No comparable alteration of 
the legislation of the four "control" states took place. 

Evidence of the effectiveness of the Connecticut crackdown* on 
speeding can be gained by comparing the path of the post-T observa- 
tions of Connecticut with those of the four control states. A sharp 
drop in fatalities in Connecticut following T in the absence of simi- 
lar drops in the control states is compelling evidence of the effect- 
ivehess of the crackdown on speeding. 

The problem of measuring the abrupt change in level of a time- 
series and making statistical inferential statements about it Is the 
problem with which the remainder of this report is concerned. 



Analysis of Data 

The Underlying Model 

The statistical model upon which analysis of the Connecticut 

speeding data is based was developed by Box and Tiao (1965) . Box 

and Tiao presented an analytic technique for estimating and making 
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inferences about the change in level of a non-st at ionary time-series. 
The model upon v^hich the analysis is based is a restrictive one; how- 
ever / many sets of data can be manipulated or transformed into special 
indices in such a way that the assumptions of the model will be large- 
ly met. The statistical model here employed is a special case of the 

integrated moving average process (Box and Jenkins, 1962): 

t-1 

2 ^ “ L *(* Y ^ ^ ^ “ 1/ ... / n . (i) 

j=i ^ 

L is a “location parameter" descriptive of the over-all general 
level of the series, 

Y is a parameter which depends upon the interdependency of the 
observations in the time-series, and 

is an observation of a random normal variable with mean 0 
and variance . 

Formula (1) describes the n^^ observations taken prior to the 
introduction of a treatment, e.g., the Connecticut crackdown on speed- 
ing. The ri 2 observations following the introduction of the treatment 
into the time-series differ from (1) only in that a treatment effect, 

6, is present. 



t-1 

= L +Y 2 a^+6 , t = n,, . . . / nQ^ + n 2 . (2) 

j=l 

The parameter 6 is the increment or decrement in the level of 
the time-series due to the introduction of the treatment. The treat- 
ment is assumed to work an immediate and constant effect, 6 , upon the 
time-series. 

The fundamental time-series model regards the system as being 

subjected to periodic random shocks, the a^, (which have zero mean) . 

Furthermore, a proportion, Y# of each shock is assumed to remain in 

the system to influence the movement of the system through time. 
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Hence/ the effect of some extraneous/ random influence on the system 
is not immediately dissipated but continues to v/ork a lessened in- 
fluence on sxibsequent observations. In a sense/ the model describes 
the path of a point taking a random walk which is imbedded in "noise." 

If the value of Y is known/ least-squares estimates of L, which 
is generally of no interest/ and 6, which is of primary interest/ can 
be readily obtained (see fomulas 3.7 and 3*8 in Box and Tiao/ 1965). 
However/ one is unlikely to know the value of y • Not knowing y * one 
may use sample information to determine probable values for Y . Box 

and Tiao (1965) presented a Bayesian approach to obtaining informa- 

\ . 

tion about the value of Y. When a uniform prior distribution is set 

on y, the Bayesian analysis is equivalent to inspecting the likelihood 

* 

function of y. The likelihood of y as a function of the sample data 
is given in equation (5.8) in Box and Tiao (1965) . 

The analytic strategy which the Box-Tiao procedure leads to in 
most instances is one of calculating the likelihood- distribution of 
Y from the data/ finding an estimate of 5 and its standard error for 
all values of Y from 0 to 2.0/ and setting confidence inte3r(;als around 
the estimate of 6 or testing the significance of the difference of 
the estimate of 6 from zero for the likely values of y • 

However/ before such an analysis of data may proceed/ an effort 
must be made to check the appropriateness of the model in (1) and (2) 
for the data in hand. This can be done in large part by inspecting 
the graph of the time-series and of the correlograms of the data. 

Data which conform to the model in (1) and (2) will have the follow- 
ing properties: 
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1. There will be an absence of cycles in the original data, i.e 

the Hi + n 2 observations The data will appear to fluc- 

tuate around a constant elevation, L, with only minor or 
momentary drifts away from this baseline. In other words, 
sustained "drifts" from a baseline in one direction probably 
indicate a violation of the model, viz., probably has a 
non-zero mean. 

2. The correlogram of the original data, the z^., is free of 

cycles and shows a random fluctuation around a baseline. The 
correlogram does not show the familiar damped cyclic curve 
characteristic of autoregressive time-series. (In a time- 
series quasi-experiment it is necessary to calculate correlc 
grams separately for pre-treatment and post-treatment obser- 
vations, since a large treatment effect will produce strong 
lag correlations.) 

3. The correlogram of the differences between successive obser- 
vations in the time series, i.e., the correlogram for 

(t = 2, ... / + n. 2 ) has a lag 1 correlation 

which is larga in absolute value when Y deviates from 1.0 
and all higher lag correlations are near zero. In fact, 

cov(z^ - ~ cov(z^ - 

^t+k ~ ^t+k-1^ * ® k > 1. The associated lag 1 auto- 

- (1 -Y ) 

correlation of the differences is ■ — 

1 + (1 - y )2 

For example, if y =1, all lag correlations of the differ- 
ences between successive values are expected to be zero. 
Fortunately, approximate hypothesis tests are available for 
testing the significance of the lag correlations (Bartlett, 
1946) . 

Investigation of the Fit of the Model to the Data 

The basic data were traffic fatalities for the 60 months prior 
to the Connecticut speeding crackdown in January 1956 and for the 
subsequent 40 months for Connecticut, Massachusetts, Rhode Island 
New York and New Jersey. As the first step in the investigation of 
the fit of the integrated moving average model to these data, each 
monthly fatalities count was divided by the number of miles driven 
in the state during that month. The transformed raw data thus became 
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"monthly fatalities per 100,000,000 miles driven" for all five states. 
Such a transformation would effectively eliminate any upward linear 
trend in the data [no such trend may appear in the integrated moving 
average process in equation (1)] due to increases in population, 
number of drivers, number of cars, etc. 

Inspection of the plot of "monthly fatalities per 100,000,000 
driver miles" showed marked yearly cycles, as one might expect. The 
"peaks" of the cycles coincided with the winter months (Dec. - Feb.) 
the "valleys" occurred during the summer. Such cycles are a clear 
violation of the assumptions of the integrated moving average model. 
The correlogram for "fatalities/100,000,000 miles" for Connecticut 
showed the "damped sine curve" with a period of 12 months which is 
characteristic of data possessing yearly cycles . (The manner in 
which the cycles were removed from the data will be discussed later.) 
It will be instructive for the moment to observe the "monthly fatal- 
ities per 100,000,000 driver miles" with the cycles left in. These 
data appear in Figure 1. 

It can be seen in Figure 1 that the fatalities per 100,000,000 

% 

driver miles reached the highest point in the period 1951-1955 in 
December, 1955. To the extent that this "emergency" prompted 
Ribicoff's decision to crack down on speeding in late December, 1955, 
the decline immediately following the crackdown can be partly inter- 
preted as the natural tendency of observations chosen for their ex- 
tremity to regress toward a central value. 

There is a marked decrease in fatalities per 100,000,000 driver 
miles from December, .1955, to January, 1956. However, there are also 
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decreases in fatalities/100,000,000 mi. in six of the eight possible 
comparisons of a December with the immediately following January. In 

i 

fact, the drop in fatalities/100,000,000 miles from December, 1957, 

« 

i to January, 1958, is almost equal to the drop from December, 1955, to 

i • 

I 

January, 1956. h natural drop from any December to the immediately 
following January in fatalities/100,000,000 miles is quite apparent 
in Figure 1. Such cycles are also obvious in the graphs of monthly 
fatalities per 100,000,000 miles in the four "control" states. 

The following technique was employed to remove the cycles from 

I I 

the data. Since the cycle had a period of twelve months, the average 
fatalities/100,000,000 miles for each of the nine Januaries (1951* 
1959) was subtracted from each January observation. Similarly, obser* 
vations on each of the other eleven months were deviated around the 
average (over nine monthly values) fatalities/100,000,000 miles for 
that month. This was done for each of the five states. (A constant, 

2 or 3/ was then added to these transformed scores to make them all 
positive.) 

These transformed data showed neither apparent cycles nor upward 
or downward trends. The data appear in Figures 2*6. In this form, 
the data appear to satisfy the first condition of the integrated 
moving average model in equation (1) . The next step in the examine* 
tion of the fit of the integrated moving average model to the data 
involves the correlograms of the transformed observations and the 
differences between adjacent observations in the series. 

Correlograms v;ere calculated on the data in Figures 2*6 for pre* 
January 1955 (nj^=60) and post*January 1955 (n2*48) data separately. 
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(A marked change in level of a time-series due to a treatment effect 
would alter the autocorrelations from what they would be in the fun- 
damental process which generates the observations in the time-series 
hence, in judging the fit of a model to data from a time-series ex- 
periment, correlograms must be calculated separately for pre- and 
post- treatment observations.) To conserve space, these correlograms 
are not reproduced here. None of them showed the “damped sine curve" 
characteristic of autoregressive series. Indeed, each correlogram 
appeared to be no more than a random array of non-significant auto- 
correlations characteristic of the correlogram to be expected from 
data conforming to the integrated moving average model. 

The next step in the investigation of the fit of the model in 
(1) to the data is to calculate the correlogram for the differences 
between adjacent observations, It is necessary to calcu- 

late these 'differences separately for the pre-treatment and post- 
treatment data. Only the correlograms for the 60 pre-treatment ob- 
servations for each state are examined here. As was pointed out earl- 
ier, if the model in (1) is satisfied, the lag 1 autocorrelation of 

2 

the differences will equal -(1 -Y )/[l + (1 - Y ) J, where 

Yis an unknown parameter in the model, and the lag 2 and greater 
autocorrelations of the same data will equal zero. Not knowing y# 
it is necessary to obtain an estimate of it. Later it will be seen 
how the likelihood distribution of ^ can be found from the N observa- 
tions, z^, and the maximum likelihood estimate of y found therefrom. 
The maximum likelihood estimates for each of the five states were 
found to be the following: 
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Corresponding Expected 



State 


Maximum Likelihood 
Estimate of Y 


Lag 1 Correlation 

z - z 
t t-1 


Connecticut 


.01 


o 

in 

• 

' 


Massachusetts 


.01 


1 

• 

Ul 

o 


Bhode Island 


.01 


-.50 


New York 


.16 


-.49 


New Jersey 


.11 


-.49 



In light of the above data, the correlograms for the 59 observa- 
tions of for each state should present a lag 1 correlation 

I 

of approximately -.5 and lag 2 and greater correlations which differ 
insignificantly from zero. The first such correlogram— for the Conn- 
ecticut data— appears in Figure 7. The jagged line in Figure 7 is 
the plot of the lag 1 thru lag 30 autocorrelations for the 59 pre- 
treatment observations z^ - for Connecticut. The lag 1 autocor- 

relation of -.555 agrees quite closely with the expected value of 
-.50. Superimposed upon the graph of the correlogram are two curved 
lines indicating those points which lie two standard deviations from 
the mean, zero, in the distribution of the lag k autocorrelation co- 
efficient for samples of size 59 from a population in which the co- 
efficient is zero (see Bartlett, 1946) . Only the lag 1 autocorrela- 
tion coefficient is significantly different from zero in Figure 7; 
hence, the conditions of the model— as reflected in the correlogram of 
2 ^ - — appear to be met by the Connecticut data. 

The correlograms (lag 1 through lag 20) for Bhode Island, Massa- 
chusetts, New York, and New Jersey for the 59 pre-treatment 
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appear in Figure 8. None of the lag 1 auto- 



observations - 2^.]^ 
correlation coefficients differs appreciably from the expected values 
of -.50 and -.49. The lag 2 and greater autocorrelations are distri- 
buted around zero with only three coefficients (viz., lags 18 and 19 
for Massachusetts and lag 4 for New Jersey) lying further than two 
standard errors from zero. (The curved lines marking off two stan- 
dard errors in the distribution of the autocorrelation coefficients 
which appear in Figure 7 can be applied to the data in Figure 8 as 
well.) The total import of the data in both Figures 7 and 8 is that 
the conditions of the integrated moving average model in equation (1) 
which are reflected in the correlograms of "^t-1 reasonably 

satisfied by the data for the five states. 

After transformation of the data and removal of cycles, the data 
on fatalities for the five states appear to satisfy all of the condi- 
tions of the integrated moving average model in equation (1) reason- 
ably well. We shall proceed with the analyses assuming the data are 
adequately described by such a model. 

Analysis for Change in Level of the Five Time-Series 

First, we shall consider in turn the individual analyses for 
changes in level between the 60th and 61st months of the five time- 
series in Figures 2-6. The analysis of the Connecticut data (Figure 
2) will be considered in detail. Summaries of the analyses will be 
presented for the other four states. After consideration of the in- 
dividual analyses, the five sources of data will be combined into a 
single analysis comparing Connecticut with the "control states." 
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i\s v;as pointed out earlier, the objective of the analysis of a possibly 



change in level of a time-series of the integrated moving average type is 



to obtain a least-squares estimate of 6 in ( 1 ) and a distributional state- 



ment about the estimate. Provided Y is known, this objective is relatively 
easily attained. The least-squares estimate of,s, namely g, and an esti- 
mate of its standard error, 5 ( 8 )/ are given in Box and Tiao (1965) by for- 



mulas (3.8) and (3.11), respectively. Under suitable assumptions of normal- 
ity and independence of errors, ( 8-6 )/a(S) has a ^-distribution with n^^ + 
1 I 2 “* 2 degrees of freedom. A test of the hypothesis that 6*0 can be 
carried out with the test statistic ^ * 8 /®(^) . 



However, if y is not known (as is generally true) , it is necessary to 
obtain information from the sample of N observations about probable values 
for Y • Either of two strategies might be followed: 1) find the maximum 
likelihood estimate of Y and estimate 6 and ^ ( 6 ) using only that maximum 



likelihood estimate; 2 ) plot both the likelihood distribution of y and the 

' 

1 

j AAA 

value of ^ * 6 / 0 ( 6 ) against the value of y as it ranges between 0 and 2 and 



see if the Jb-statistic is clearly significant or non-significant over the 



j range of probable values for y . [The likelihood distribution of y given the 

1 . . ■- ' ■ 

I N sample observations is found from formula (5.8) in Box and Tiao (1965) .] 

i • • 

! ’ i 

I Both strategies will be employed at different points in the analyses to fol- 

j 

! 1 . . 

j low.* 

* ' -i 

i 

1 

j A. Analysis for Change in Level of the Connecticut Data (Figure 2) . 



The n,]^ = 60 observations preceding the crackdown on speeding in Connec- 



ticut and the n 2 = 48 post-crackdown observations were subjected to the an- 



alysis outlined in Box and Tiao (1965) for unknown y. The likelihood 



^Calculations performed on computer program described in Chapter II 
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distribution of y given the 108 observations is denoted by h(y| z) in 
Figure 9. The area under the curve h(Y|z) is one unit. The maximum like- 
lihood estimate of y is seen to be 0. The curve denoted by ;t in Figure 9 
is the value of 6 / 0 ( 8 )— read off the right ordinate in the figure — for 
each value of y from 0 to 2 . 

As can be seen by inspection of the two curves in Figure 9 almost all 

the mass of the likelihood distribution of y lies between 0 and 0.25, the 

former being the maximum likelihood estimate and the latter being quite un- 
likely; over this range (0 to 0-25) the value of the ^-statistic for test- 
ing the hypothesis that 6=0 (against 6 < 0) ranges from rO-86 to 

-2.05 (from nonsignificance at the .15 level to significance at the -05 

.level). Note also that ^ is significant at the .05 level with a one-tailed 
test only for y above . 12 . If y is set equal to its maximum likelihood 
estimate, namely 0, ^ is nonsignificant even at the .15 level. Inspection 
of the graphs is facilitated by the dotted lines which mark off the values 
of t, (^ = 106) required for significance at the .01, .05, .10 and .15 
levels for a one-tailed test of the hypothesis that 6=0. (For the four 
control states the alternative hypothesis is that 6 > 0 -) 

The analysis reported in Figure 9 will support neither a confident 
acceptance nor rejection of H; 6 = 0 . The analysis proved sensitive to 
the unknown value of y . H; 5 = 0 can be rejected at the .20 level of 
significance; but a more cautious decision rule, say a = . 01 , would 
not lead to rejection of H; 5 =0 for any likely value of Y . A 
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conservative hypothesis tester would probably view the data as provi- 
ding nor support for the rejection of H: 6 ^ 0. 

B. Analysis for Change in Level of the Massachusetts/ Bhode Island/ 
New York/ and New Jersey Data. 

In Figure 10/ the likelihood distributions and ^-statistics for 
testing H: £ s 0 are presented for the four "control" states. In all 
analyses/ the likely values of the unknown parameter Y fall below .30. 
The maximum likelihood estimates of Y are .01 for both Massachusetts 
and Bhode Island. For New York and New Jersey/ the maximum likelihood 
estimates of y are .16 and .11/ respectively. 

Considering only the value of t. for the maximum likelihood esti- 
mates of Y/ the Massachusetts data yield the only value of 6 which 
differs significantly (p < .01) from zero. The ^-statistics for 
Bhode Island/ New York/ and New Jersey do not attain statistical sig— 
^^ificance at the .15 level with a directional statistical test. 

Considering the value of ;t over the ranges of likely values of y , 
neither the Bhode Island/ New York/ nor New Jersey data present any 
evidence for a value of 6 significantly different from zero. The re- 
sults for the Massachusetts data are equivocal. At the maximum like- 
lihood estimate of y, t is significant at the .01 level. At the 
point on the Y— scale above which approximately half of the area under 
h( y|z) lies, t is significant with an a between .01 and .05 with a 
one-tailed test or .02 and .10 with a two-tailed test. The value of 
t drops below significance at the .10 level for a one-tailed test 
(or .20 for a two-tailed test) above the point on the Y**scale above 
which lies approximately 25% of the area under h(Y|z). 
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None of the analyses of the four control states yields compelling 
evidence of any abrupt change in fatality rate associated with the 
events in that state immediately prior to January, 1966. The evi- 
dence ranges from definitely not supporting the presence of abrupt 
change in the case of Rhode Island to slightly equivocal as evidence 
for an abrupt change in the case of Massachusetts. An alternative 
analysis exists of the multiple-group time-series experiment in which 
Connecticut is compared with the four "control" states. The differ- 
ence between fatalities/100,000,000 miles for Connecticut minus the 
average fatalities/100,000,000 miles for Massachusetts, Rhode Island, 
New York, and New Jersey embodies an "experimental and control" com- 
parison. These 108 differences have been calculated and graphed in 

♦ 

Figure 11. 

One might expect that the analysis of the differences between 
experimental and control time-series would be somewhat less sensitive 
than the analysis employed above in which the ^'s are estimated separ- 
ately for each series and then combined in planned or post— hoc com- 
parisons. The process of taking difference compounds residual varia- 
bility and thereby reduces power. The analysis of the data in Figure 
11 bears out this expectation. 

The maximum likelihood estimate of y for the data in Figure 11 
is .07 (see Figure 13) . Thus one would expect a lag 1 autocorrelation 
for the differences to equal -.50 and the lag 2 and greater 

autocorrelations to be essentially zero. These conditions are reason- 
ably well met, as can be seen by inspecting the correlogram in Figure 



12 . 
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In Figure 13, the likelihood distribution of Y for the 108 obser- 
vations and the ^-statistic for testing the hypothesis that 6=0 are 
presented for values of y between 0 and 2. It is apparent from in- 
spection of the two graphs that S is not significantly different from 
zero over the entire range of likely values for y • 

In summary, the five individual analyses gave no convincing evi- 
dence for an abrupt change in level associated with the Connecticut 
speeding crackdown for any of the time-series. The analysis of the 
differences between the monthly observations for Connecticut minus 
the average for the four control states showed no ev’dence of a treat- 
ment effect in Connecticut. In the following section, a more powerful 
analysis of the data is reported. 

The Analysis of a Planned Comparison of Connecticut with the Control 
States . 

If the time-series for each state can be regarded as independent 
of the others, well-known inferential statistical techniques can be 
employed in making comparisons between Connecticut and the four con- 
trol states. Accordingly evidence was sought concerning the degree 
of dependence among the time-series for the different states.. 

Given the normality assumption of the model in (1) and (2) , the 
independence of the various time-series can be demonstrated if the 
series show no intercorrelation. To reduce the burden of data analy- 
sis without a serious reduction in the sensitivity of the test of the 
hypothesis of no intercorrelation, data for the first 50 months for 
Connecticut, Massachusetts, and New York were used. Using "months** 
as the unit across which correlations were computed, the three 
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intercorrelaiiions of these states v;ere computed for the variable “fa— 
talities/100, 000# 000 miles minus monthly average." The intercorrela- 
tion matrix was as follows: 

Conn. N.Y. Mass. 



1 



-.105 



-.061 



R = -.105 1 

-.061 -.207 



-.207 

1 



(n = 50) 



A test was made of the hypothesis that the 50 triplets of obser- 
vations were a random sample from a tri-variate normal distribution 
in which all intercorrelations are zero (Bartlett# 1950) . The test 



statistic# - C(n - 1) - (2m + 5)/6] log^lRl # is approximately distri 
buted as a chi-square variable with m(m - l)/2 degrees of freedom# 



where m is the number of variables. The value of the test statistic 
for the data in question was 2.877# a value exceeded with probability 
greater than .30 by a chi-square variable with three degrees of free- 
dom. The three series can probably safely be regarded as independent. 
(As a general procedure# when Y may depart appreciably from 0# it 
would be better to intercorrelate the estimated residual errors# the 
a's# using the maximum likelihood estimates of Y for each series.) 

A single planned comparison will serve to evaluate the signifi- 



cance of the change in level of the time-series for Connecticut as 
compared to the changes or lack thereof in the four control states. 
This comparison has the following form: 



-(6 +6 +6 +6 )/4 

^ C M RI NY NJ 



The value of ip is estimated by replacing the parameters with their 
^east-squares estimates;, the variance of the comparison is estimated 






from the common residual variance for the five states multiplied by 
2 2 

[ 1 + 4(1/4) ]. The estimated value of ^ divided by the square root 

of an estimate of its variance follows Student's Jb-distribution with 
5(106)^ when = 0. (See Hays, 1S63, Chp. 14). However, since the 
estimated change of level effects and residual variances differ for 
different values of y , we shall estimate and test the significance of 
the comparison for the maximum likelihood estimates of y for each 
state and for reasonable upper and lower limits to the value of y for 
each state . 

The hypothesis to be tested is that = 0 against the alternative 
hypothesis that ^ < 0. In words, the hypothesis to be tested is that 
the change in level of the Connecticut series is no different from 
the average "change" for the four control states against the alterna- 
tive that it is less. 

Because the y 's are unknown, we shall specify a range between 
which each y probably lies as well as the maximum likelihood estimate 
of each y . The lower limit to the range for each state will be that 
value of Y below which approximately 25% of the area under the like- 
lihood distribution of y lies; the upper limit will mark off approx- 
imately the upper 25% of the likelihood distribution. The data for 
estimating and testing the comparisons appear in Table 1. 
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Table 1 

Values of 5 and aQ) for the IlaKimum Likelihood Estimate and 
Reasonable Upper and Lower Limits of y for All Five States 



State 


Max. Llkell. 


Reasonable Upper 


Reasonable Lower 


Estimate of y 


Limit for y 


Limit for y 


y 


.01 


.10 


.01 


1. Conn. 6 


-.152 


-.594 


-.152 


S(6) 


.176 


.391 


.176 


Y 


.01 


.15 


.01 


.2. Mass. S 


All 


.259 


.472 


a<S) 


.126 


.341 


.126 


Y 


.01 


.10 


.01 


3. R. I. S 


.079 


.326 


.079 


acs) 


.276 


.617 


.276 


Y 


.16 


.25 


.10 


4. N. Y. 8 


.275 


.247 


.337 


0(8) 


.289 


.375 


.233 


y 


.11 


.20 


.07 


5 . N. J. 8 ^ 


.198 


.093 


.331 


0(8) 


.292 


.391 


.236 



Table 2 

Results of Planned Comparisons of^S for Connecticut 

with the Average 6 

for llassachusetts, Rhode Island, New York, and New Jersey 





Max. Likeli. 


Reasonable Upper 


Reasonable 




Estimate of y 


Limit for y 


Lower Limit ; 


A 


-.408 


-.825 


-.457 


% 


.269 


.484 


.241 


t - l)-/oj 


-1.517 


-1.705 


-1.896 


Prob {t530 < t} 


.065 


.045 


.030 
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For a given set of five values of y (one for each state, is 

I 

1 

estimated by subtracting the average <S for Massachusetts, Fhode Island, 

A 

New York, and New Jersey from the value of ^ for Connecticut. The 
residual variance, assumed to be equal for all five states, is esti*- 
mated from the average of the residual variances for all states. The 
values of and ;t which correspond to the maximum likelihood es- 

timates of and reasonable upper and lower limits to Y are reported in 
Table 2 . The bottom row of Table 2 is the probability of a Student 
^-variable falling below the value of . (The probabilities in 
the last row of Table 2 can be interpreted as the smallest levels of 
significance for which the planned comparison' is significantly differ- 
ent from zero with a one-sided test.) 

Conclusion 

It can be seen in Table 2 that one may conclude that there is a 
statistically significant reduction associated with the speeding 
crackdown in fatalities/100,000,000 driver miles for Connecticut as 
compared with the four control states. 

The above conclusion must not be accepted without due considera- 

N 

tion of a source of potential invalidity in the experiment. As Ross 
and Campbell (1965) pointed out, the fact that Governor Ribicoff was 
prompted to take action in late 1955 by the alarmingly high fatality 
rate for that period introduces the possibility of a regression effect 
from the observations immediately preceding his actions to the obser- 
vations immediately fdlowing. If one observes a time-series for a 

period of time and selects that observation which appears quite extreme, 
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subsequent observations are likely to be relatively less 
extreme. The exact extent of any regression effect in the 
Connecticut time— series experiment is difficult to estimate 
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Figure 1. Fatalities/300,000,000 Driver Miles by Months for Connecticut (n =60, 
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Rhode Island PataXit ies/100, 000, 000 Driver Miles T'Jipus Monthly Average Plus 
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Figure 7. Correlogram of Differences for Pre-Treatment (n2^=60) 

Monthly Fatalities/100,000,000 Miles Minus Monthly Average Plus 2 
for Connecticut. 
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Figure 8. (cont 
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Figure 13. h(Ylz) and ^ for Difference Betv;een Connecticut Fatalities 
per One Hundred Million Miles and the Average of the Fatalities 
per One Hundred Million Miles for the States of N.Y., N.J., Mass, by Months 
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as a l^ima-Sorios Ouasi-Sxperinient 



0 . I nfc rodu. c t :l on , 



In 1959, Wolf, Lux 0 and Hax published Sche:^.dunc! una Scnaecunc-srecnr 
(Divorce and Divorces Law, « This v;orh dealt with the effects or revision 



of German divorce iav;s in 1900 on the rates of divorce and petiuion lor 
divorce. Dlisinstein (1959) revievjed their careful study and evaluated 
their conclusions. The present paper is a re-~ analysis of Wolf, Luke 
and Hax's data with newly developed inferential statistical models and 

a reappraisal in light of appropriate statistical analyses of the con- 

\ 

elusions drawn in both of the above works. 



1. The Legislation . 

On January 1, 1900, the nev; Civil Code of the German Empire re- 
placed the various legal statutes then in effect.* The Civil Code 



*The remainder of this section draws heavily upon Scheidung, und 
Scheidungsrecht by Wolf, Luke, and Eax (1959) and "Divorce and the 
Law in Germany: A Review" by Pdieinstein (1960) . 
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If the trend of the pre-T observations is altered sharply by the 
introduction of T/ we are inclined to attribute the alteration (v;hether 
it be a change in level# change in direction of drift, etc.) to T. A 
particularly important problem is to determine whether the activity of 
the time-series in the neighborhood of T indicates a genuine effect of 
T or whether it is merely an orderly continuation of an undisturbed 
time-series. We judge the problem to be "particularly important" be- 
cause the inferential statistical intuitions of. social scientists seem 
seldom to have been developed on non-independent observations (as are 
in evidence in most time-series) , thus formal statistical significance 
tests are a necessary overseer of "considered impressions" we might 
form of the data. 

The divorce rate (divorce/100,000 persons) for the German Empire 
from 1881 through 1914 is plotted in Figure 1, In Figure 2 appears 
the rate of petitioning for reconciliation proceedings for all of Ger- 
many from 1881 through 1913. Both indexes are plotted in Figure 3 for 
those states under the Prussian Code prior to 1900; the same data ap- 
pear in Figures 4 and 5 for the states under the common law and Code 
Napoleon prior to 1900, respectively. 



me 




3. Statistical Analytic Techniques . 

Finding an appropriate inferential statistical analysis of data 
from a time-series experiment has been repeatedly recognized as an im- 
portant problem (Campbell, 1963? Campbell and Stanley, 1963). The data 
in Figure 1 offer an excellent illustration of the need to perform a 

valid inferential statistical analysis in which the probabilities of 
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Protestant marriage. German common law was in effect in 12 states 
prior to 1900. 

Similar in practice to the German common law was the Code Napoleon 
which was in effect in approximately four states. Divorce was granted 
only in cases of guilty misconduct; disruption of a marriage consti- 
tuted insufficient grounds for divorce. (Divorce by mutual agreement 
was a legal possibility but rarely occurred in practice due to burden- 
some legal procedures.) 

Under the new Civil Code instigated in January of 1900/ divorce 
was to be granted solely on the grounds of guilty misconduct by one 
partner (adultery/ desertion, extreme cruelty, etc.). Divorce by 
mutual agreement was abolished. The "enlightened disruptive princi- 
ple" of the Prussian Code v;as totally displaced by the "guilt princi- 
ple" in the new Civil Code. Divorce became far more difficult for 
those who formerly lived under the Prussian Code; it became generally 
easier to obtain for those formerly under the common law (divorce was 
legally available to Catholics in ex-common law states for the first 
time) . 

2 . The Data and the Design . 

Two sets of data are available which bear on the question of what 
effects if any the revision of the divorce laws had: decrees of di- 

vorce per 100,000 population (the divorce rate), and petitions for ini- 

A 

tiation of reconciliation proceedings per 100,000 population (which 

were mandatory under German law both before and after 1900) . These 

data were reported by Wolf, Luke, and Hax (1959) for the period 1881- 

-65- 




of divorce laws. 



The Civil 



brought about a general "tightening up" 

Code divorce ic^ws naa been drafted in a spirit of hostility toward 
divorce and w_cn tna intention of reversing the steadily increasing 

100, OCo r:-iaD 2 .*cents in Germany rose from 
lo8i L.O 17*0 in 1399.) -ander the new law, divorce was to be 
granted only in the case of guilty misconduct; divorce was not to be 
allowed in cases where there was mutual agreement that the marriage 

should be dissolved or where circumstances had thoroughly disrupted 
the marriage. 

The new Civil Code was uniform across the German states, whereas 
divorce laws in effect in the various states prior to ISOO were of 

general types. The possible effects of the nev; Civil Code coulc 

d>_pended upon uhe particular divorce laws in effect before 1900; 

thus it will bo advisable to analyse the effect of the new Civil Code 

on the divorce rate for three groups of states ™ corresponding to 

the three types of pre-1900 legislation - as well as for the German 
Empire as a whole. 

Approximately eight states were under the divorce laws of the 
Prussian General Code prior to 1900. The Prussian Code was the most 
lenient as regards divorce. Divorces were granted in cases of mis- 
conauct, mu^-ual agreement, and even upon grounds of "insuperable aver- 
of one par..y lor the other. The Prussian Code recognized "dis- 
ruption" of the marriage beyond repair as grounds for divorce. 

In contrast to the lenient Prussian Code, the German "common law- 
embodied ecclesiastical law concerning divorce. Catholics could not 
divorce, and only grave misconduct was grounds for dissolution of a 
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incorrect decisions can be knoivii exactly and controlled. The data in 
Figure 1 appear to show the expected drop immediately after the change 
of legislation in 1900.* In fact, the raovement of the divorce rate 
index is larger between 1899 and 1900 than between any other pair of 
years. However, the 3 point drop between 1899 and 1900 is only 0.2 
larger than the 2.8 rise between 1881 and 1882. It would seem in- 
cautious, then, ^to attempt to draw any conclusions by mere inspection 
of the data or by the application of intuitive judgment. 

Wolf, Luke, and Hax (1959) considered analyzing the data in Fig- 
ures 1 through 5 by fitting least-squares regression lines (dependent 
variable-divorce rate; independent variable-year) to the pre-treatment 
and post- treatment data separately and testing "whether the two lines 
connect" or whether the datum for 1900 appears to be a simple extra- 
polation of the pre-treatment regression line. Their suggestion is 
equivalent to the "Mood-test" suggested by Campbell and Stanley (1963, 
p. 213). Wolf, Luke, and Hax recognized the shortcomings of their 
suggestion and refrained from any inferential statistical- analysis. 
They stated their concerns about statistical procedures for analyzing 
their data as follows**^: 



*There are 18 observations for the 19 years from 1881 to 1899. Divorce 
data were not available for the entire German Empire in 1892 and 1893. 
The observation graphed half-way between 1892 and 1893 is an estimate 
determined in Wolf, Luke, and Hax (1959) . 

**The following passage was rather freely rendered from the original 
German, but it is substantially correct. 
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"If the post-treatment regression line connects directly with 
the pre-treatment regression line, then the change of laws has not 
brought about a shift of level. If, however, it lies higher or 
lower, the possibility exists that v;e are dealing with an effect 
of the new laws .... 

"Where in fact a material shift can be established, it is a 
question of ascertaining whether this shift is to be ascribed to 
the influence of the change of laws. This can only be accepted 
when the data show that the shift of level occurred exactly be- 
tween 1399 and 1900. 

"One could suppose that the data are randomly distributed 
around the regression line. The shift in level between 1899 and 
1900 could be regarded as significant under this assumption if 
the datum for 1899 lay within the chance region surrounding the 
pre-treatment regression line but the datum for 1900 fell outside 
this chance region. One could run the alternative test and estab- 
lish whether the datum for 1900 lies within the chance region of 
the post-treatment regression line and the datum for 1^^9 lies 
outside of the chance region around this regression line. 

"If one were to proceed in this way, then one would have to 
make use of the standard deviation, a , in ascertaining the limits 
of the chance region. The standard deviation of a series of 
values is given by the (Quadratic mean of all deviations of the 
individual values from the arithmetic mean. In the case of a 
regression line, the deviations of the data from the correspond- 
ing predicted value take the place of the deviations from the 
arithmetic mean. If there exist reasons for assuming that the 
data are distributed as a Gaussian (normal) distribution, then 
the probabilities would amount to .6827, .9545, and .9973 that 
a value deviates less than one, two and three standard deviations, 
respectively, from the mean. For a deviation of a value from the 
mean of more than three standard deviations it could be assumed 
rather safely that a special influence instead of a chance fluc- 
tuation is being exhibited. The same conclusion would no doubt 
be clear if the deviation were merely two standard deviations 
from the regression line. 

"It can not be assumed, however, that the process is repre- 
sentative of the present case. Can it be assumed that the fluc- 
tuations of the number of divorces are the result of a neutral 
and unchanging law of a random distribution? Is the number of 
divorces described as the result of a series of mathematically 
isolatable factors? Is the chance region into which this number 
must fall unequivocally determined when these factors remain con- 
stant? The answers to these questions have been given in part 
previously. (See page 12 in Wolf, Luke, and Hax (1959) .] There 
exists little inducement to assume that the data are distributed 
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around the regression line according to a constant mathematical 
distribution law let alone according to the law for the Gaussian 
normal distribution. Hence, the calculation of chance regions 
in this connection appears to be senseless.'* 

(Wolf, Luke, and Hax, 1959, pp. 129-132.) 

Wolf, Luke, and Hax seem overly concerned about the validity of 
the assumption of a normal distribution. And in fact, in the passage 
quoted and on page 12 of their book they express reservations about 
the validity of any stochastic model as a representation of a social 
system. They appear to argue that "chance" is an inadequate explana- 
tion of social phenomena for which we can find explanations, and they 
appear to draw some gratuitous connection between the normal distribu- 
tion and chance phenomena. We can with good success predict and "ex- 
plain" human stature; the fact that height tends to be normally dis- 
tributed in adults does not mean that stature is the result of unknown, 
chance influences. 

: 

A valid inferential statistical analysis is available for time- 
series experiment data, but it is more difficult than fitting and 
extrapolating least-squares regression lines. 

In 1965, Box and Tiao developed a method of evaluating the change 
in level between two successive points in time of a non- station ary time- 
series. Observations are taken at equally spaced time intervals and 
one wishes to make inferences about a possible shift in level of the 
time-series associated with the occurrence of an event at a particular 
point in time. If there is an abrupt shift in the level of a time - 

series between the third and fourth observations, evidence of a treat- 
ment effect may exist. 
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The statistical model underlying the Box-Tiao analysis of change 
in level of a time-series was the integrated moving average model. 

t-1 

« L + o, and z «L + y 2 a ,+a (1) 

A It t-1 t 

for the 11 ^ observations prior to he introduction of T, and 

t-1 

.z^»L+6 + y 2 ■*■ “i (2) 

i=l 

for the n^ * N - n^ observations following T, where: 

2^ is the value of the variable observed at time t, 

L is a fixed but unknown location parameter/ 

Y is a parameter descriptive of the degree of interdependence 
of the observations in the time-series and takes values 

0 < Y < 2, 

is a random normal deviate with mean 0 and variance o2 , 

5 is th e change in level of the time-series caused by T . 
Essentially the model implies that the system is subjected to 
periodic random shocks/ , (with zero mean) a proportion (y) of which 
are absorbed into the level of the series. Data which conform to the 
model in (1) and (2) are such that the graph of the time-series follows 
an erratic, somewhat random path with slight/ but no systematic drifts , 
trends, or cycles. Data which show a systematic increase or decrease 
over time — such as population and various growth curves — violate 
the assumption of zero mean for the random variable a .. For generality/ 
the random variable portion of the model can be allowed to assume an 

expected value other than zero; thus "drifting" time-series — those 

showing a constant rise or fall over time — can be accommodated. The 
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cfensralization of tiliG modGl in (1) and (2) is callsd tlis "intGCfratod 
moving avGrago modGl with dGtGrministic drift"* and takes the following 



form: 



t-1 

2 ® L + 3. and zs«L + y2 3 +3> 

l t t-1 t** 



(3) 



for the observations prior to the introduction of T, and 



t-1 

2 «L+6+y £ B 
* i=l 



, + 3 > 

t-1 t 



(4) 



for the n 



- N - n ^ observations following T, 



where L, y d 3^e interpreted as in the model in (1) and (2) , 

but now £ is a normal variable with variance and mean ecrual 



to 



The parameter y describes the rate of ascent or descent of the 
time-series. 

It is illuminating to express 3 as y + a and manipulate (3) 
into a form similar to (1) : 

t-1 

2 ® L + pY(t-l) + P + Y 2 a ^+a (5) 

i«l ^ 

One sees by inspection of (5) that the time-series in (3) will be 
expected to have "drifted" pYt units at time 

In the setting of the time-series quasi-experiment, interest cen- 
ters on estimating 6 in (4) and testing its significance. The 



*The "integrated moving average model with deterministic drift" was pre- 
sented by G.E.P. Box and G.M. Jenkins on pp. 33-34 of "Models for Pre- 
diction and Control, III.. Linear Non- stationary Models," Technical 

Report No. 79. Madison: Dept, of Statistics, University of Wisconsin, 

July, 1966. 
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following steps lead to the least-squares estimate of 6 and its distri- 



bution. 



t-2 



By setting and y^. * “ Y ^ (1“Y)^ model 

J ^ 



can 



be written as Y •* X 0 •+■ e where X is defined as an If x 3 matrix of 



weights as follows: 
1 1 



T 

X^ - 



• • • 



I 



• • • 



I 



,N-1 



1 (1 - y ) • • • (1 - y )”^~^ J *“ y ) 

0 • 1 (1 - y ) • • • “ Y )^2 



• • • 



0 is a 3 X 1 vector such that 0^ * (y, L, 6) ; and e is an N x 1 vector 

T 



t the elements of which 



of random normal deviates, e 
have mean y and variance a^. 

When Y is known, simple least-squares estimates of y, L and 6 can 



be found from the familiar solution to the least-squares normal equations : 



0 



6 



T .. -1 T 
(X^X) ^ X Y 



( 6 ) 



The least-squares estimates in (6) each have a ^-distribution with 
jl - 3 ^ when divided by appropriate estimates of their standard error. 



In particular, 

(S - — 

<L-0/<.’CS')-Vr - 

(6 - 6)/(s 4^) where 



(7) 

( 8 ) 
(9) 



^ (Y^Y - 0^X^X0) / (N-3) and c^^ is the jth diagonal element of (X X) 



The above results follow from the linear model Y « X 0 + e in which 



the errors, e, are assumed to be normal, homos cedas tic, and independent. 
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Tli 0 QU3ntij.1iy s is tiliG JTGsidusX vsx’i.sncs/ th© varisnc© of y 

aft 0 r th© modal X 0 is fitt©d to it. 

All of the above operations on the linear model are made for a 
given value of y . VJhen y is unknown (as will generally be true) a 
Bayesian analysis using sample information about Y is used in making 
inferences about 6. The posterior distribution, h(y|z), of Y given 
a set of N observations and assuming a uniform prior distribution is 
known to within a constant of proportionality. The posterior distri- 
bution of Y assuming a uniform prior (in which case the posterior 
distribution is equivalent to the likelihood distribution of y ) is 
given to within a constant of proportionality by the following formula: 

Illustrations of how the posterior distribution of y in (11) is 
considered jointly with 6 in making inferences about 6 for the simple 
integrating moving average model in (1) appear in Box and Tiao (1965) 
and Maguire and Glass (1967) . 



Data Analysis a nd Results . The data in Figure 1 were subjected to 
the analysis outlined in Section 3.* In Figure 6 appear graphs of the 



*One condition which data following the model in (4) must satisfy is 
stated in terms of the correlogram of the differences between succes- 
sive observations, i.e., The lag 1 autocorrelation coeffi- 

cient should approximate - (l-y) /U+(l-Y )^] , and lag 2 and greater 
autocorrelation coefficient*© of -z^ should approximate zero for 
tne 18 pre-treatment observations, the lag 1 autocorrelation of the 
17 differences ^^^^^-z^was 0.127, which corresponds reasonably closely 
to the expected value — calculated from the maximum likelihood esti- 

0*100. The lag 2 through lag 9 autocorrelations were 
.027, .033, -.419, -.013, .003, -.434, -.504 and .155, respectively. 
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likelihood distribution of y and the jt-statistic in (7) for testing 

the significance of the deviation of 6 from a hypothesized value of 0. 

Nearly all of the mass of the likelihood distribution of y is contained 

between the values 0,50 and 1.90.' The maximum likelihood estimate of 

the unknown y is approximately 1.13. The value of 6 /a (6) is 

clearly significant •— it is never greater than -4.50 — over the 

entire range of likely values of y. The hypothesis 6 = 0 can be 

confidently rejected in favor of the alternative that 6 < 0. Thus we 

• ^ 

see that the downward shift of the rising divorce rate after 1900 was 
quite statistically significant; chance can safely be discounted as 
the e:q>lanation of the downward movement of the time-series after 1900. 
Inspection of Figure 1 seems to indicate that the effect of revision of 
the divorce laws was temporary. The conclusion that the effect of the 
change in legislation was temporary depends upon the perhaps gratuitous 
assumption that the trend from 1881 to 1913 would have been linear (as 
opposed to curvilinear) in the absence of legislative change. 

The results of the analysis of the data in Figure 2 appear as 
Figure 7. The dotted lines on Figure 7 indicate the values below 
which t must fall to allow rejection of 6 « 0 in favor of 5 < 0 at the 
•05, .025 and .005 levels of significance. The graphs of h(y|z) and jt 
present a picture of somewhat marginal statistical significance. The 
value of t is significant at the .05 level and beyond for y above 1.09. 
The fact that approximately 80% of the likelihood distribution of y 
exceeds 1.09 lends support to rejection of 6 » 0 in favor of 6 < 0 
at a respectable level of significance. 

The analysis in Figure 7 of the petition for reconciliation rate 
data in Figure 2 is particularly interesting in that visual inspection 
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of the time-series leaves an impression of no treatment effect which 
is at variance with the results of the statistical analysis. Wolf, 

Luke, and Hax (1959) and Rheinstein (1959) concluded that the revision 
of the divorce laws in 1900 had no effect on the rate of petition for 
reconciliation. It is difficult to interpret whether these authors 
are using the terms “no effect" to mean "no statistically significant 
effect," "no socially significant effect," or "no permanent effect." 

In the first sense, one could reasonably take issue with the conclusion 
of "no treatment effect." It is not our purpose to argue the validity 
of conclusions of "no effect" in the second and third senses. 

In Figure 8 are presented the analyses for change in level of the 
divorce rate and the petition for reconciliation rate for the German 
states grouped by type of legislation prior to 1900. These analyses 
will be summarized below for the three groups of states in turn. 

The average divorce rate and petition for reconciliation rate for 
12 Prussian Code states are graphed in Figure 3. The graphs of the 
data create a distinct impression of a strong effect due to the revision 
of legislation in 1900. The analyses graphed in the left portion of 
Figure. 8 substantiate the statistical significance of the observed down- 
ward shifts in the divorce rate and the petition for reconciliation rate. 
For the divorce rate, the value of t, = 6/o(6) is never greater than 
-3.90; t. is approximately -4 at the maximum likelihood estimate of y . 

It can be confidently concluded that the divorce rate shifted its level 
downward at 1900. The petition for reconciliation rate also showed a 
significant downward shift at 1900; ^ was less than -3 for all likely 

values of y , as can be .seen in the upper- left portion of Figure 8. 
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As i^as pointed out earlier, the new Civil Code instigated in 1900 con- 
stituted a “tightening" of divorce laws in those states previously 
under the Prussian Code. Introduction of the new legislation should 
have worked a negative effect upon the divorce and petition for recon- 
ciliation rates. Such effects are reliably observable in the data. 

The average divorce and petition for reconciliation rates for 
eight Common Law states are graphed in Figure 4. Inspection of the 
behavior of both time-series in the vicinity of 1900 would probably 
lead to no confident conclusions about the possibility of treatment 
effects. The petition for reconciliation rate increases from 1899 to 
1900, but not dramatically so. The decrement in the divorce rate from 
1899 to 1900 is even less dramatic, and can not be confidently ruled 
out as a chance occurrence by mere inspection. The analyses for change 
in level of the petition for reconciliation and divorce rates at 1900 
appear in the middle-upper and -lower portions of Figure 8. In the 
lower-middle portion of the figure, the graphs of h(Y|z) and t appear 
for the divorce rate. The value of ;fc is less than -3 for all likely 
values of y • Hence, the rather small downward shift in the divorce 
rate is nonetheless statistically significant and can not be reasonably 
attributed to chance. The shift in level of the petition for reconcil- 
iation rate is equally statistically significant for the Common Law 
states; however, whereas there was a decrement in the divorce rate at 
1900, there was a statistically significant increment in the petition 
for reconciliation rate. It should be recalled that under the new Civil 
Code, divorce became legal for Catholics in Common Law states for the 
first time. One might speculate that the data support the conclusion 
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that the new Civil Code brought about petitions for reconciliation 
from Catholics in the Common Law states, but that the courts held to 
their newly enacted unsympathetic attitude toward divorce. Of course, 
such speculation goes far beyond the data. 

The average divorce and petition for reconciliation rates for four 
states under the Code Napoleon prior to 1900 are graphed in Figure 5. 
There appears to be a downward shift of level in the divorce rate at 
1900; however, the petition for reconciliation rate does not appear to 
have been affected by the introduction of the new Civil Code. This 
latter observation was borne out by the failure of ;t * 6/S(S) to 
attain significance in the test for a change in level at 1900 of the 
petition for reconciliation rate. As can be seen in the upper-right 
portion of Figure 8, t falls below +2 for approximately the upper 75% 
of the likelihood distribution of y ; at the maximum likelihood esti- 
mate of Y, namely 0.67, the value of t is +1.10. The data on the 
divorce rate for the Code Napoleon states show a statistically signi- 
ficant downward shift at 1900 (see the lower-right portion of Figure 8) . 
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Conclusions 

The conclusions we shall draw from the above analyses will be at 
variance with those drawn by Wolf, Luke^ and Hax (1959) and Rheinstein 
(1959) • With respect to petitions for reconciliation proceedings. Wolf, 
Luke and Hax concluded the following: 

The introduction of the new Civil Code [in 1900] has not re- 
duced the increase of the number of petitions for conciliation 
proceedings and has thus not reduced the extent of the divorce 
desire. Preponderantly the new law has not haa any effect in 
this respect. In some regions in which the divorce law was liber- 
alized one can observe a certain increase of the trend. It is by 
no means certain, however, whether this increase would not have 
occurred independent of the change in the law. Nowhere was the 
progressive trend retarded. Even in the regions of the Prussian 
law, where the divorce law was tightened, the trend did not change 
in any significant way.* 

Rheinstein (1959, p. 493) observed that Wolf, Luke and Hax were 
“certainly justified in concluding that the draftsmen: of the new code 
have failed in their expectation of reducing the desire for divorce." 

With respect to the divorce rate. Wolf, Luke, and Hax concluded 
that "the shape of the law of divorce was neither the cause of the 
divorce wave nor even one of its essential conditions. In the face of 
other circumstances, the influence of the law did not make itself felt 
at all." 

Rheinstein (1959, p. 495) concurred: 

... before 1900 the [divorce rate] was rising in the dis- 
tricts of most and, since 1900, in those of all appellate couirts. 

In a few court districts the trend shows a slight downward break 
in 1900. The majority of the latter districts belongs to the 
region of the Prussian Code, but there are among them also two 
districts of Protestant common law. In all these districts the 



*The translation is du e to Rheinstein (1959, p. 493) . 
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break is smalls and the trend rose continuously after* I90d, While 
the break in the Prussian law districts may be attributed to the 
change of the law^ it was insignificant and without lasting effect . 
Nowhere did the change turn the trend downward; and nowhere did 
it prevent its continuous rise. (Italics added.) 

Both Scheidung und Scheidunasrecht and lOieinstein* s review leave 

us with the conclusion that 

The experiment made by makers of the Civil Code refutes the 
notice (sic] that a limitation of the statutory catalogues of 
grounds for divorce. to situations of guilt could result in a re- 
duction of the number of divorces or even in their rate of in- 
crease. On the other hand, the present Marriage Law (of 1938] 
has refuted the apprehension that the introduction of the dis- 
ruption principle would naturally result in an increase of divorce. 
No causal or even statistical connection exists in one direction 
or the other . 

Rheinstein saw Wolf, Luke and Hax*s work as confirmation of 
Willcox’s conclusion that “the immediate, direct and measurable in- 
fluence of legislation is subsidiary, unimportant, almost imperceptible.” 
(Willcox, 1897).. 

We contend that the conclusions just stated make an unfortunate 
use of the word “significant" and that they depend for their validity 
upon extrapolations of pre-1900 trends for which there exist as a basis 
neither compelling logical reasons nor convincing empirical evidence. 
Furthermore, we feel that the only . conclusion which may be drawn from 
the data with confidence is that the effect of the introduction of the 
new Civil Code in 1900 is clearly reflected in both the divorce rate 
and the petition for reconciliation rate. 

All too frequently, social scientists extended the meaning of the 
term "significant" beyond its strictly appropriate sense as it applies 
to statistical hypothesis testing and made unwarranted interpretations 



of social value, merit, or importance of data when they are merely in- 
ferentially reliable — the appropriate meaning of "statistically sig- 
nificant." Having been disabused of this confusion in our enlightened 
age, social scientists now react quite cautiously to the words "signi- 
ficant" or "insignificant" when they are applied to data; they are 
careful to read inferential reliability into the word and nothing else 
Thus, Bheinstein risked serious misinterpretation of the facts when he 
chose to call the break in the divorce rate curve at 1900 "insignifi- 
cant" without the benefit of a valid statistical analysis and without 
apprising the reader of the value system against which he judged the 
downward shift to be without social value or importance — the populat 
sense of "insignificant." We have shown that the changes in level of 
the divorce and petition for reconciliation rates around 1900 are 
statistically significant (with the exception of the petition for re- 
conciliation rate in the Code Napoleon states) . It does not seem 
justifi^le to refer to the shifts in level as "insignificant" in any 
inferential statistical sense. 

It was also concluded above that if any effect of the 1900 revi- 
sion of the divorce laws did occur it was "temporary" or "without last 
ing effect." It was claimed that granting a remote possibility of an 
effect of the new Civil Code the graphs of the divorce rate and the 
petition for reconciliation rate quickly returned to a trend line one 
could extrapolate from the pre-1900 trends. Such a casual impression 
can be "read into" the graphs in Figures 1-5, though in most instances 
it is equally easy to confirm an impression of the decrement accruing 
during 1900 lasting through 1914. However, both impressions are 

- 80 - 






uncritical. Why must one assume that a somewhat linear trend from 1881 
to 1899 should continue from 1900 to 1913 or 1914? The answer is of 
course that one need not* In fact, to do so is a matter of faith. One 
could argue that the new Civil Code was instrumental in preventing an 
exponential increase of the divorce and petition for reconciliation 
rates after 1900;* But to argue either point goes beyond the data. 
Without comparable "control groups" — states like those in the German 
Empire whose divorce laws were not revised in 1900 — no unequivocal 
answer can be given to the question "What would the post~1900 trend 
of the divorce and petition for reconciliation rates have been?" 

Previous discussions of the data in Schieidunq und Scheidungsrecht 
by Pheinstein and Wolf, Luke and Hax have discredited the one conclu- 
sion which can be drawn with defensible validity. The time-series 
quasi-experiment rivals the completely randomized experimental design 
for validity in some instances. But the inference which enjoys a 
healthy measure of validity concerns an instantaneous shift in the 
level of the time-series at the introduction of the experimental treat- 
ment and not suppositions about how the time-series should behave long 
after the treatment has been introduced. There appears to be little 
doubt that the revision of the divorce laws in 1900 did produce statis- 
tically significant effects on bcth the divorce and petition for 

*In the U.S. the divorce rate was rising at a faster rate during the 
second half of the period from 1887-1917 than it was during the first 
half of that period. The divorce rate rose 31 points (47 to 78) from 
1887 to 1902, but it rose 42 points (78 to 120) from 1902 to 1917. A 
comparison of the pre-1900 and post-1900 trend lines for both the di- 
vorce rate and the petition for reconciliation rate in Germany reveals 
about the same acceleration of the rates after 1900. 




reconciliation rates. Whether the effects were temporary or relatively 

permanent cannot be determined validly from the available data. The 

« 

supposition that the effects were temporary should not be cited as 
though it somehow calls into question the one conclusion for which 
convincing evidence exists/ namely that both the divorce and petition 
for reconciliation rates show the evidence of adoption of the new Civil 
Code in 1900. 
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Figure I. DIVORCE RATE FOR GERMAN EMPIRE (1881- 
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Figure 2. PETITION FOR RECONCILIATION RATE FOR GERMAN EMPIRE (1881-1913) 
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Figure 3 DIVORCE AND PETITION FOR RECONCILIATION RATES FOR STATES UNDER PRUSSIAN CODE PRIOR TO 1900 
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Rgure 4. DIVORCE AND PETITION FOR RECONCILIATION RATES FOR STATES UNDER COMMON LAW PRIOR TO 1900 
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Figure 5: DIVORCE AND PETITION FOR RECONCILIATION RATES FOR STATES UNDER CODE NAPOLEON PRIOR TO 190 
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Figure 6. ANALYSIS FOR CHANGE IN LEVEL AT 1900 OF DIVORCE RATE FOR GERMAN EMPIRE (DATA IN FIGURE I) 
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Figure 7 ANALYSIS FOR CHANGE IN LEVEL AT 1900 OF PETITION FOR RECONCILIATION RATE FOR GERMAN EMPIRE (DATA IN FIGURE 2). 



Chapter V 



AN ANALYTIC TECHNIQUE EMPLOYING A CHANGE IN DRIFT 
OP A TIME- SERIES ASSOCIATED WITH 
THE INTRODUCTION OP A TREATMENT 

/ 

The model of Chapter IV with its procedures for the 
estimation and significance testing of both the constant 
drift, y, of a time-series and its instantaneous shift in 
level, 6, at the introduction of a treatment provides a 
powerful tool for analyzing a large class' of time-series 
esqperiments • In this chapter, we shall report on an even 
more general model and analysis which were developed by 
Dr. George C. Tiao for use in this project. 

The Problem 

It may occur that the effect of the introduction of a 
treatment, T, into a time-series does not result in an 
instantaneous change in the level of the series but does 
change the direction of its drift. The series of observa- 
tions in Figure 1 evidence no change in level at T but a 
change in direction of drift. In Figure 2, both changes 
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at T are present. 
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Figure 1. A Time-Series Showing a CSiange of Direction of 
Drift but No Change in Level at T. 
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Figure 2. A Time-Series Showing Both a Change in Level 
and a Change of Direction of Drift at T, 
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The model of Chapter IV can be modified so that a para- 
meter descriptive of a change in y, the drift of the series, 
is incorporated. We shall see how it is then possible to 
estimate all of the parameters in the model for a given 
value of Y and to test hypotheses about each. Finally, the 
likelihood distribution of y can be found for a set of n 
observations for use in inferential analyses of 6 , y, and 
the change in u. 



The Model 

Let denote the observation of a series at time t 
The following model is proposed for the n^^ observations 
prior to the introduction of a treatment T: 



t-1 



2 ^ ■ L + Yw(t-l) + y + Y + Y 2 + a^. 



( 1 ) 



where the interpretation of the elements of the model are 
identical to their interpretation in Chapter IV. The fol- 
lowing model is put forward as descriptive of the behavior 
of the series for the XI 2 obsearvations following the intro- 
duction of T: 



t-1 



■ I* + Yw(t-l) + y + Y^(t“i^i“l) + A + Y 2 a. + a + 

j-1 ^ ^ 



5 , 



( 2 ) 




where 6 is the change of level of the series between times 
and and A is the change in the drift of the 

series between these two times. Prior to the series 
drifts (on the average) at a rate of yp units (up or down 
depending on the sign of jj) for each unit of time; after T, 

the series drifts y(p + A) units on the average for each 
unit of time. 



Analytic Procedures 



As before, a collection of n^^ + n^ observations of 
2 are made; these values of are then transformed for a 
given value of y as follows: 



?! - 



y* - 



- y I (1-y)^"^ z, . 
j-1 



( 3 ) 

£ot 2 + n ^. 



The (iij^ ■‘•ilj) 1 vector of y's can be expressed as a 

linear model in terms of the design matrix X, the vector of 
parameters e’’ - (l, 4, y, 4) and the vector, e, of observa- 
tions of a random normal variable with variance o^, as in 
Chapter IV: 
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y « X0 + e 
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For a single value of y, the least-squares estimates 
of the parameters in 0 are obtained from the equation 



P 

A 

A 

A 

L 

A 

6 



(x'^X)“^ X^y. 



(4) 



The "residual variance” in fitting the model in (1) 
and (2) to the observations is given by 

- [(y - X0)^(y - X0)]/(iij^ + - 4). 



(5) 
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The following distributional statements about the esti- 
mates of the parameters follow from the assumption of nor- 
mality of a and traditional sampling theory: 
t 

A 

y - y 

4 *ni-ta2-^' 



A - A 



^ ^1+112-4' 



( 6 ) 



L - L 






6-6 

/ / T t . , , where 

44 iin+n«-4’ 



sljc 



‘r“2 



Is the jth diagonal element of (X'^X) 



'p »i 



The above calculations are performed for a single 
value of Y which is restricted to the open interval (0/2). 
Since y is generally unknown, information regarding its 
likely values must be found from the data themselves. The 




















likelihood distribution of y given the n^ + n^ observations 
2 ^ is given — • to within a constant of proportionality ~ 
by formula (7) : 

h(Ylz) « s-(“1+“2-3)^ 

Suppose one wishes to test the following hypotheses at 
the a-level of significance: 

Hq.* 6 - 0, Hq: a » 0. 

H^: 6 0, ^1* ^ ^ 0- 

In this instance, the investigator wishes to ascertain 
whether or not the introduction of a treatment between time 
and time ^ results in either an instantaneous shift 

of level of the series, an instantaneous shift in the direc- 
tion of the drift of the series, or both. 

For each value of y between 0 and 2, the values of 6, 

A and s^ are obtained using formulas (4) and (5) . The null 

hypotheses H^: a = 0 and a * .0 are tested with the t- 
statistics t - S/Cs^fc^ * ) and t » A/(s'\| c^^ ) which both have 
Student's t- distribution with + 1 I 2 - 4 degrees of free- 
dom if the null hypotheses are true. These values of t are 
likewise calculated for each value of y between 0 and 2. 
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Next, the likelihood distribution of Y is determined and 
the graph of t for I and the graph of t for S are inspected 
over the range of Y*® which are shown to be likely by 
inspection of h( yIz) • It is determined whether the value 
of t for either 5 or A tends to be significant [i.e., out- 
side the 100 (o/2) and 100(1 - a/2) percentile points in the 
^-distribution with nj^ + 1 I 2 ” 4 or nonsignificant over 

the range of likely values of y. Of course, very large (in 

^ 1 ■■ ■ 

absolute value) values of t = 6/(s\[c^ ) or t « A/(s"\lc ) 
lead to rejection of Hq: 6 = 0 or Hq: ^ =0, respectively. 

Computer Programs 

Computer programs for the analyses in Chapter IV and 
this chapter appear as Appendices A and B, respectively, 
to this report. Source dechs for both programs are avail- 
able upon request from the authors. 
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Appendix A 

Computer Program for Analysis of 
Time- Series Experiment with Constant Drift 
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Appendix B 



Computer Program for Analysis of 
Time-Series Experiment with 
Possible Change in Drift 
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n r> rvn r» n n n r> 



IME SERIES I NCORPORAT Ii\G ^hamGE I .N DRIFT 



PARAMETER CARD ONi 



COLS • 



1 -.2 



RR=NUMBER OF PROBLEMS TO BE RUN 



CARDS ARE REQUIRED FOR EACH PROBLEM TO BE RlJN 



CARD 

CARD 

CARD 

POST 

CARD 



? COLS. 2-80 TITLF 

3 COLS 1-80 format 

4 COLS. 1-4 NUMBER 
treatment MEASURES- 



OF PROBLEM 

data 



FOR 



OF PRE TRE^-^' 



CARDS.. IN THE FORM (FORMAT) 
MENT measures. COLS. 5-8 



COL 



rv 

1 IF 



CORRELOGRAM REQUIRED 



NUMBER OF 
0 OTHERWISE 



5 etc. data cards FOR FIRST PROBLEM NO SEPERATION BETWEEN PRE AND POST 



TREATMENT DATA 

DIMENSION Z(50C) ,Y (50( ) ,X(50C94) ,XTXIN{4»4) »XTX(4,4) 9XTY(4) 
DIMENSION THETA (4) 9FMT( 18)9TITLE(18)?PD(20C) ,XOUT (201,11) 

D I MENS I ON SE(4),T(4),B(4,1) 

DIMENSION XX(200,10) ,DELCON ( 200 , 5 ) ,FNAM(5,5) ,G1 (5) ,62( 5) ,G3(5 ),G4( 
15) ,G5( 5) ,G6(5 ) >G7( 5 ) ,G3 ( 5 ) ,G9( 5 ) ,G10( 5 ) 

DATA G1/20HT. -CHANGE IN LEVEL / 

DATA G2/20HT. .CHANGE IN SLOPE / 



DATA G3/20HSCALFD POSTERIOR / 
DATA G4/20HL0WER 99 PERCENT / 
DATA G5/20HLOWER 95 PERCENT / 
DATA G6/20HDFLTA / 
DATA G7/20HUPPER 95 PERCENT / 
DATA 58/20HUPPER 99 PERCENT / 



XXCK=1.0E-15 
NRB = 0 

RFAD(5,36)NR 

36 FORMAT (12) 

39 READ(5,1 ) (TITLE( I ), 1=1,13) 

1 FORMAT ( 18 A4) 

JM=n 

NTT=0 

G=0.01 

NRR=NRB+1 

WR!TF(6,37)NRB 

37 FORMAT (//9H PROBLEM 12//) 
WRITE(6,1) (TITLE( I ) ,1=1,18) 

READ ( 5, 1 ) (FMT ( I ) , I =1, 18 ) 

READ ( 5,3 ) N1 ,N2 ,NCC 

3 FORMAT (2 14, II ) 

NT0T = N1-HM2 

READ( 5,FMT ) (2 ( I ) , I =1,NT0T ) 

WRITE (6,601 ) N1»N2 

601 FORMATC //8X,4H N1=I3,4H N2=I3//) 
WRITE ( 6,602 ) 

602 ;0RMAT(15H INPUT DATA) 

WRITE ( 6,600 ) (Z( I ), 1 = 1 ,N1 ) 

600 FORMAT (5 ( 1XE13.5) ) 

NNN=N1-^1 

WRITE (6,603) (Z( I ) ,I=NNN,NTOT) 

603 FORMAT ( //5 ( 1XE13.5 ) ) 

IF( NCC ) 702,701,702 

702 CALL CORREL (N1,N2,Z) 

701 CONTINI^E 

WRITE (6,666 ) 



666 format: IHO) 
WRITE(6,22 ) 



22 


FORMAT ( 1X114H 
1 FOR 

WR ITE( 6,2022) 


RESIDUAL 
T FOR 


CHANGE IN 


T FOR 
T FOR 


CHANGE IN T 
SCALED ) 


2022 


FORMAT ( 1X117HGAMMA 
1 CHANGE SLOPE 


VARIANCE 

SLOPE 


LEVEL 

SLOPE 


LEVEL 

CHANGE 


LEVEL 

POSTERIOR) 



er|c 



* c 
c 



.45 

40 



42 



41 

5 



C 

C 



1000 



2000 



25 CONTINUE 

CALCULATION OF Y SCORES FROM THE DATA ..CHECK 
IS 1.0 AND Y(N) IS NEARLY EQUAL TO Y(N~1) 
Y(1)=Z(1) 

DO 5 I=2»NT0T 
11 = 1-1 

YY=ABS(Y(in) 

IF( YY-XXCK)4? *42 »45 
IF(YY-. 000001 )40,40,41 
GG=ARS( 1.0-G) 

IF(GG-.001)42*42»41 
Y(I)=Z(n-Z(II) 

GO TO 5 

Y( I ) = (Z( I )-Z{ 1 1 ) ) + { 1.0-G )*Y( 1 1 ) 

CONTINUE 

CALCULATION OF WEIGHTS. IF ABSOLUTE VALUE OF X 
IS SET EQUAL TO ZERO TO PREVENT UNDERFLOW 
DO 1000 I=1»NT0T 
xn»i)=i.o 
DO 2000 I=1»N1 
X( I»2)=0.0 
NNN1=N1+1 

DO 2001 I=NNN1»NT0T 
X( I »2) = 1.0 
X(l»3)=1.0 
X(2,3)=1.0-G 
DO 6 I=3,NTOT 
11 = 1-1 

X( I,3)=X(2.3)*X(II .3) 

XXX=ABS(X( I »3 ) ) 

IF(XXCK-XXX)6»6»32 
X( I .3 )=0.0 
CONTINUE 
DO 7 1=1, N1 
X( I,4)=0.0 
NN=N1+1 

DO 8 I=NN,NTOT 
I I=I-N1 

X(I,4)=X(II,3) 

XXX = ABS{ X(I ,4) ) 

IF(XXCK-XXX)8 ,8,33 
X(I,4)=0.0 
CONTINUE 
DO 11 1=1,4 
DO 11 J=l»4 
XTX(I,J)=0.0 
XTXINC I ,J)=n.0 



FOR UNDERFLOW WHEN GAMMA 



IS LESS THAN l.OE-15, THEN 1 



2001 



32 

6 



33 

8 



11 



C CALCULATION OF X TRANSPOSE X INVERSE 





DO 4050 


1=1 ,4 




DO 4050 


J=1 ,4 




DO 4050 


K = 1 ,NTOT 


4050 


XTX( I ,J)=XTX( I , J)+X(K,n*X(K,J) 




DO 2002 


1=1,4 




DO 2002 


J = 1 ,4 


2002 


XTXIN( I 


,J)=XTX(I,J) 




DO 4061 


1=1 ,4 


4061 


XTY( I )= 


c 

• 

c 




DO 4062 


>— 1 

It 




DO 4062 


J=1 ,NTOT 


4062 


XTY( I )= 


XTY( I )+X( J, n*Y( J) . 



- o - 

ERIC 



] 



urii 



I 



DO 2003 T=l»4 
. 2 00 ^ ^ ( I ♦ 1 ) r 1 

CALL MATINV{XTXIM»4-.B9l »DET) 

I DO 2020 1=1,4 

2020 THETA( I )=B( I ,1) 
i DO 2004 1=1,4 

2004 SF ( I ) =XTX IN (1,1) 

FNTOT=NTOT 

YTY=0.0 

c calculation of the residual variance 

18 1 = 1 ,NTOT 

18 YTY=YTY-4-Y ( I ) 

DO 31 1=1,4 

31 XTY(T)=0.0 

DO 19 J=l,4 
DO 19 1=1,4 

19 XTY( J)=XTY( J)+THETA( I ) -J^XT X ( I , J ) 

FITVAR=0.0 

DO 20 1=1,4 • 

2 0 FTTVAR = F!TVAR + XTY( I )*THFTA( I ) 

' S=YTY-FITVAR 

S=S/(FNTOT-4.0) 

c calculation of the standard errors of delta and mu 

DO 2010 T=l,4 

SE( I )=SQRT(S^^-SE( I ) ) 

! 2010 T ( I )=THETA{ I ) /SE( I ) 

c calculation of the posterior distribution... logs are used to prevent 
c overflow 

SK=ALOG( S ) 

DET=AL0G(DET) 

H= { ~o 5-«^DET ) - ( . 5"-- { FNTOT-4. 0 ) *SK ) 

H= ,4342945*H 

J'<=JKs-l 

XOUT { J’<,1 )=G 

XOUT( J<,? )=S 

XOUT ( JV' ,3 )=THFTA (3 ) 

XCUT(JK,4)=T(3) 

XOUT ( j:<,5 )=THETA (4 ) 

XOUT( JK ,6 )=T(4) 

XOUT( JK,7)=THFTA(1 ) 

XOUT ( JK ,8 } =T (1 ) 

XOUT ( JK,9 )=THETA (2 ) 

XOUT( JK,I0)=T (2 ) 

I F { N TCT-3 0)1004,1005,1005 
1005 DFLCON( JK,1 ) = THET A ( 4 ) -2 . 58^-SE ( 4 ) 

DFLC0N( JK ,2 ) = THETA ( 4 ) -1 .96-:^SE ( 4 ) 

DELCON( JK ,3 )=THETA (4) 

DELCON { JK ,4 ) =THETA (4 ) -f 1 . RS^i-SE ( 4 ) 

DFLCON( JK,5 ) = THETA ( 4 ) +2 . 5 8^'SE ( 4 ) 

1004 N!T=NlT-rl 
PD(N!T)=H 

! C INCREMENT GAMMA BY .01' AND ITERATE 
G=G-;-l.O00 00 0n0E~0 2 
IF(NIT-199)3O,30,26 

30 GO TO 25 

25 CONTINUE 

C FIND MAXIMUM VALUE OF THE • POSTER I OR 
F!N=PD(1) 

I DO 506 :=2,199 

! I!^(FIN-Pn(I ) )505,506,506 



iERLC 

U — 




505 FIN=PD(I) 

. ^ 506 CONTINUE 

C rescale posterior by dividing all values of the posterior by the iMAX value 

DO 507 1=1,199 
PD( n=PD( I )-FIN 
YY=ARS(PD( I } 1 
IF(YY-35.0) 509,508 ,506 

508 PD ( I) =0.0 
GO TO 507 

509 PD( I ) =PD( I) /.4342945 
PD { I) =EXP(PD( I ) ) 

507 CONTINUE 

C CONVERT AREA OF THE POSTERIOR DISTRIBUTION TO UNIT AREA. BY METHOD OF TRAPEZ 
AREA=0.0 
DO 511 1=2,199 

IT=I~1 

511 AREA=AREA+.005^(PD ( n+PD( 1 1 ) ) 

DO 512 1=1,199 

512 PD(I)=PD(I)/AREA 
DO 513 1=1,199 

513 XOUT( I , 11 )=PD { I ) 

DO 514 1=1,199 

514 WRITE(6,23) (XOUT( I ,J) ,J = 1,11 ) 

23 FORMAT! 1X,1F5.2,10( 1XE10.3) ) 

DO 700 1=1,199 
XX(I,1)=X0UT{ 1,5) 

XX( I ,2)=X0UT( 1,9) 

700 XX! I ,3 )=XOUT{ 1,11 ) 

M=199 
N = 3 

ISCALF=1 

C TITLE FOR PLOT OF THE POSTERIOR DISTRIBUTION AND FOR STUDENT T 
DO 760 1=1,5 
FNAM( 1 , 1 ) =G1 ( I ) 

FNAM(2»I )=G2( I ) 

760 FNAM(3,I )=G3( I ) 

CALL PLOT (M,N, ISCALE,XX,FNAM ) 

M = 5 

C PLOT CONFIDENCE INTERVALS AROUND DELTA 
I F{NTOT-30) 1007,1006,1006 
1006 DO 1008 1=1,199 
DO 1008 J=l,5 
1008 XX( I ,J)=DELCON( I,J ) 

, DO 756 1 = 1,5' 

FNAM( 1 , 1 )=G4( I ) 

FNAM(2 , I )=G5( I) 

FNAM(3, I )=G6( I ) • 

FNAM(4, I )=G7( I ) ’ 

756 FNAM(5,I)=G8( I ) 

V.'RITE(6,1016) 

1016 FORMAT! //34H CONFIDENCE INTERVALS AROUND DELTA) 

WPIT'=^! 6, 1017) 

1017 FORMAT! /3X,5HGAMMA ,2X,52H LOWER 99 LOWER 95 DELTA UPPER 95 
1 UPPER 99) 

00 1019 1=1,199 

1019 WRITE!6,1018)XOUT! 1,1) ,!DELCON! I,J) ,J = 1,5) 

1018 FORMAT(3X,1F5.2,1X5E10.3 ) 

WRITE! 6, 1015) 

1015 FORMAT! //IX, 47H GRAPH OF CONFIDENCE INTERVALS AROUND DELTA HAT) 

CALL PLOT!M,N, ISCALE,XX,FNAM) 
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- 1007 NR=MR-1 

IF(NB) 38,38,39 
I 3 8 CO.NTINUF 
STOP 

pMn 

SUBROUTINE CORREL (N1,N2»Z) 

j DIMENSION 7.(500) ,RLAG(400) , Y { 500 ) , XX ( 2 00 , 1 0 ) ,FNAM(5,5) 

DIMENSION G9(5),G10(5) 

DATA G9/20HPRE TREATMENT DATA / 

DATA G10/20HP0ST TREATMENT DATA / 

NT0T=N1+N2 

C PREPARATION OF TITLE FOR CORRELOGRAM PLOT 
DO 804 JJJ=1,4 
I ND=JJJ 

IF( IND“2 )805,806,807 
807 IF(IND“4)805,806,806 

805 CONTINUE 

DO 757 1=1,5 

I 757 FNAMf 1 ,I ) =G9( I ) 

I GO TO 808 

806 CONTINtjP 

DO 758 1=1,5 
758 FNAMd ,I )=G10( I ) 

I . 808 CONTINUE 

GO TO (809,810,811 ,812) ,IND 

C PREPARATION OF DATA TO CALCULATE AUTOCORRELATIONS FOR PRE TREATMENT DATA 
j 809 NLAG=N1 . 

NLOW=l 
NT0P=N1 
WRITE (6, 8 13) 

813 FORMAT (//38H CORRELOGRAM OF PRE TREATMENT RAW DATA) 

GO TO 314 

C PREPARATION OF DATA TO CALCULATE AUTOCORRELATIONS FOR POST TREATMENT DATA 
810 NLAG=N2 
NL0W=N1+1 
NT0P=NT0T 
WRITE (6,815) 

815 FORMAT! //39H CORRELOGRAM OF POST TREATMENT RAW DATA) 

814 11=0 

I DO 706 I=NLOW,NTOP 

I 1 = 11 + 1 

706 Y( I I ) =Z( I ) 

GO TO 709 

C PREPARATION OF DATA TO CALCULATE AUTOCORRELATIONS FOR PRE TREATMENT 
1 C DIFFERENCES BETWEEN SUCCESSIVE OBSERVATIONS 

j 811 NLAG=N1-1 

;1 NL0W=1 

NT0P=N1-1 
WRITE (6,800) 

i 800 FORMAT (//41H CORRELOGRAM OF PRE TREATMENT DIFFERENCES) 

j: GO TO 816 

i: C PREPARATION OF DATA TO CALCULATE AUTOCORRELATIONS FOR POST TREATMENT 

! C DIFFERENCES BETWEEN SUCCESSIVE OBSERVATIONS 

I 812 NLAG=N2“1 

I: NL0W = N1 + 1 

ij NTOP=NTOT-l 

: WRITE (6, 803) 

J 803 FORMAT(//42H CORRELOGRAM OF POST TREATMENT DIFFERENCES) 

I 816 11=0 

DO 707 I=NLOW,NTOP 

j 

' t ■■ O ' ' 

j EMC 






TI=IT+1 
‘ IK=I+1 

707 Y( !I )=Z{ IK)~Z(I ) 

709 JJ = NLAG-x-3/4 

DO 716 ;< = 1#JJ 
NUP=NLAG~K 
CROS=0.0 
SUM1=0*0 
SUM2--n*0 
AU'^-.S1=0.0 
SUMS 2 =0.0 
DO 703 I=1>NUP 
NI ND=I+K 

CROS=CROS+Y( I )*Y(NIND) 

SUMi = SUMl-f-Y( I ) 

SUM2=SUM2+Y{NIND) 

SUMSl = SUMSi+Y ( I )->«-Y ( I ) 

703 SUMS2 = SUMS2+Y (NIND)-x-Y(NIND) 

FNUP=NUP 

DNUM=CR0S-(SUM1-«-SUM2 ) /FUUP 

DEN=(SUMSl~(SUiVil*SUMl ) /FNUP ) * ( SUMS2- ( SUM2’"\SUM2 ) /FNUP) 

DEN=SQRT (DFN) 

RLAG(K)=DNUM/DEN 
WRITF(6,704)K »RLAG (K) 

704 F0RMAT(1X»5H LAG=I3,3X»3H R=F6.3) 

716 CONTINUE 

ISCALE=1 
N = 1 
M=JJ 

DO 650 I=1»JJ 
650 XX( I 9 1 } =RLAG (I ) 

CALL PLOT (M»N» ISCALE,XX,FNAM) 

804 CONTINUE 
RETURN 
END 

SUBROUTINE PLOT ( M ,N » ISCALE » XX » FNAM ) 

C ADAPTED FROM THE SUBROUTINE-GRAPH- OF -PERSUB- WRITTEN BY J.H.WARD? J-. » 
C KATHLEEN DAVIS, AND' JANICE BUCHHORN , LACKLAND AIR FORCE BASE, TEXAS 
C ADAPTED FOR FORTRAN II BY T.O.MAGUI RE ,UN I VERS ITY OF ILLINOIS 

DIMENSION RAT (10) ,FND(5,10) ,ENCRMT( 10) ,PA(120) ,FMT(3) ,PB(10) 
DIMENSION XX (200, 10) ,AMAX(10) ,AMI N ( 1 0 ) , B ( 1 0 ) ,FNAM(5,5) ,B3(10) 

DATA BB( 1 ) ,BB (2 ) ,BR{ 3 ) ,BB (4) ,BB (5 ) ,BB(6 ) ,BB(7 ) ,BB( 8) ,BB( 9) ,BB ( 10)/ 

1 IHl, 1H5 ,1HD,1H5, IHl, 1H0,1HX,1HM,1HH,1HA/ 

DATA PCROSS,PBLANK ,PDASH,PERI0D/1H^" , IH ,1H-,1H./ 

DATA FMT(l) ,FMT(2) ,FMT(3)/1H. ,1H0,1H-/ 

IF (N-5 ) 200,201,200 
201 DO 276 1=1,5 

276 R ( I )=BR( I ) 

GO' TO 202 

200 DO 277 1=1,3 
11=1+5 

277 B ( I )=BB( II) 

202 CONTINUE 

99 FMS=FMT( ISCAlE-1 ) 

98 DO 100 I=1,M 
AMIN ( I )=+l.E+37 
100 AMAX ( I )=-l.E+37 
C SEARCH FOR MAXIMA AND MINIMA 
ABMIN = AM IN ( 1 ) 

ARMAX=AMAX ( 1 ) 










■ J* 
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DO 101 1=1 »M 
no 101 J=i»N 
C0MP=XX{ I 9 J)~AMIN( J ) 

! F (COMP )A002 ,400394003 

4002 AMIN( J)=XX( I, J) 

4003 C0MP=XX( I 9J)-AMAX { J) 

IF (COMP ) 101 9101 ,4004 

4004 AMAXC J ) =XX( I 9 J) 

101 CONTINUE 

IF(M-5)250,25l9250 

251 DO 252 J=l95 
AMAX( J)=AMAX{5) 

252 AMIN( J)=AMIN( 1 ) 

250 CONTINUE 

N1=N 

DO 108 J=1,N 

C COMPUTE RESOLUTION OF GRAPH 

108 RAT( J)=( AMAX( J)-AMIN( J) )/ll0* 

DO 110 J=1,N 

ENCRMT( J)=(AMAX( J)-AMIN( J) )/4. 

FND( 1,J)=AMIN( J)-^.05 
C0MP=AMIN ( J) 

IF (COMP) 4005 9 4006, 4006 

4005 FND( l9J)=FND( 19J)~.10 

4006 FND( 59 J)=AMAX( J)-.05 
COMP=AMAX(J) 

IF (COMP) 4007,4008 9 400 8 
C PREPARE ordinate LABELS 

4007 FND(59 J)=FND( 59 J )-#10 

4008 FND( 29 J) =AMI N ( J ) +ENCRMT ( J ) +• 05 
C0MP=FND( 29 J ) 

IF (COMP) 4009, 40 10 9 40 10 

4009 FND( 29 J) =FND( 29 J )-. 10 

4010 FND( 3 9 J) = AMIN ( J ) + ( ENCRMT ( J) ^-2. ) + .05 
COMP=FND{ 3, J) 

IF (COMP) 4011 94012,4012 

4011 FND( 3 9 J) =FND( 3, J ) ~. 10 

4012 FND(4, J) =AMAX (J) “ENCRMT (J)+. 05 
COMP=FND{ 4, J ) 

IF (COMP )4013, 110, 110 

4013 FND(4, J)=FND(4,J)-.10 
110 CONTINUE 

C PRINT LEFT HAND LABELS 

WRITE{6,7) ( (FND( I, J) 9 1=1,5 ) ,B( J) 9J=1 9 NI ) 

7 FORMAT(///lXF6.l921XF6.l921XF6.l921XF6.l921XF6.l93XAl) 
C PLOT left hand MARGIN 

DO 4014 1=1 ,59 

4014 PA(I)=PDA5H 
WRITE(6,8) (PA( I ) 91=1,59) 

8 FORMAT( 1X,59A2 ) 

DO 140 1=1 ,M 

DO 4015 I 1 = 1,120 

4015 PA ( I I ) =PP.LANK 

DO 121 IX=27,81,27 
121 PA( IX)=PERI0D 
MCOM^= I 

4016 NCOMP=NCOMP-10 

IF (NCOMP ) 124,4017 ,4016 

4017 CONTINUE 

DO 123 IX=69l20,2 



• 123 PA ( I X) =PDASH 

c rescalf: data points 

124 DO 135 K=1,M 

2L= ( XX ( I , K) -AN: IN (K ) ) /RAT ( K) + 1 . 0 
L = ZL 

IF{ L~1 )6018 ,6019 96019 

6018 L=1 

6019 IF{ 110--L)6020, 6021 ,6021 

6020 L=110 

6021 I F ( PA ( L ) • EQ* PBLANK ) GO TO 130 
IF(PA(L) .E0.PERI0D)G0 TO 130 
I F ( PA ( L) o EQ* PDASH ) GO TO 130 
PA(L)=PCROSS 

GO TO 135 
130 PP(K)-R(K) 

PA { L ) =PB ( .< ) 

135 rONTTNU'=^ 

IF( ISCALF-1 )6023,136,6o23 
6 FORMAT ( A I ) 

6023 WRITE(6,6 )FMS 
C PLOT DATA POINTS 

136 WRITE ( 6 ,2 ) I ,PER lOD , ( PA ( J ) , J=l , 1 10 ) , PERIOD , I 
2 format { IX , 13 ,A1 ,11 OAl , A1 , 13 ) 

140 CONTINijF 

DO 6026 1=1,59 
6026 PA(I)=PDASH 



142 

3 



122 

4 



C 



C 

10 

15 

20 

30 

C 

40 
45 
50 
60 
70 
80 
65 
90 
95 
1 00 
1 05 
110 
C 

130 



WRITE (6,8 ) (PA( I ) ,1=1,59) 

WRITE(6,7) ( (FND( I , J) ,1 = 1,5) ,B(J) ,J=1,NM 

WRITE (6, 3) 

FORMAT! 1H0,10X,1 6H PLOT DESCR I PT I ON/ 1 HO , 7X , 5HT I TLE , lOX , 9HCHA RAC TER 
14X ,7HMI N IMUM , 4X ,7HMAX IMUM ,4X , 1 OHRE SOL UT I ON ) 

DO 122 J=1 ,N 

WRITE (6,4 ) ( FNAM( J, K) , K= 1 , 5 ) , B ( J ) , AM I N ( J ) ,AMAX ( J ) ,RAT( J) 

FORMAT ( iX,5A4,2X,Al,7XF6.3,3XF8.3,4XF8.3) 

RETURN 

END 



SUBROUTINE MAT I NV ( A ,N , B ,M , DETERM ) 

MATRIX INVERSION WITH ACCOMPANYING SOLUTION OF LINEAR EQUATIONS 
DIMENSION A(4,4) ,3(4,1) ,IPIV0T(4) ,INDEX(4,2) ,PIV0T(4) 

EvOU I VALENCE ( IROW, JROW) , ( I COLUM , JCOLUM ) , ( AMAX , T , SW AP ) 

INITIALIZATION 

DETERM=1.0 

DO 2 0 sJ=l ,N 

IPTV0T(J)=0 

DO 550 1 = ] ,N 

SEARCH FOR PIVOT ELEMENT 
AM A X= 0.0 
DO lo5 J=1,N 

IF ( IPIV0T( J)~l ) 60,105,60 
DO 100 K=1,M 

IF ( IPIVOT(K)-! ) 80,100,740 

IF (ABS (AMAX)-ABS (A(J,K))/ 85, 85, 100 

IROW=J 

iCOLUM=K 

AMAX=A ( j ,K) 

^ONTINO- 



GO'-' T T,’' I »r 

I P I VOT { I COL UM ) = I P I VO T ( I COLUM ) + 1 
INTFRCgANG- rows TO PUT PIVOT ELEMENT ON 
IF ^ROW-ICOLUM) 140,260,140 



DIAGONAL 





















140 DETERM=~DETt:RM 
*» 150 DO 200 L=1,N 
160 5WAP=A( IROW»L) 

170 A ( IROW,L) =A { ICOLUM »L) 

200 A f ICOLUM,L)=SWAP 
2 05 IF(M) 260 9 2 60, 210 
210 DO 250 L=1,M 
22 0 SWAP = B ( IROKSL ) 

230 B { IROW,L)=B ( ICOLUM ,L) 

250 R ( ICOLUM,L)=SWAP 

260 INDEX { r ,1 )=IROW 

270 INDEX( I ,2 )=ICOLUN 

310 PIVOK I )=A( ICOLUN, ICOLUM) 

320 DETERM = DETERM-)«-P!VOT{ I ) 

IF(PIVOT(D) 330,720,330 
C DIVIDE PIVOT ROW BY PIVOT ELEMENT 

330 A ( ICOLUM, IC0LUM)=1 .0 
340 DO 350 L=1,N 

350 A ( ICOLUM, L) =A ( ICOLUM , L ) /P I VOT ( I ) 

355 IF(M) 380,380,360 
360 DO 370 L=1,M 

370 B( ICOLUM, L)=B { I COLUM , L ) /P I VOT ( I ) 

C REDUCE NON-PIVOT ROWS 

380 DO 550 Ll = l ,N 
390 IF(Ll-ICOLUM) 400,550,400 
400 T=A(L1 *ICOLUM) 

420 A (Ll , ICOLUM) =0.0 . 

430 DO 450 L=1,N 

450 A(L1,L)=A(L1,L)-A( ICOLUM, L ) ^"T 

455 IF(M) 550,550,460 
460 DO 500 L=1 ,M 

5 00 B(L1,L)=B(L1,L)-B{ I COLUM , L ) -"'T 

550 CONTINUE 
C INTERCHANGE COLUMNS 

600 DO 710 1=1 ,N 
610 L=N+l-I 

•620 IF(INDEX(L,1)-INDEX(L,2) ) 630,710,630 
630 JROW=INDEX ( L, 1 ) 

640 JC0LUM=INDEX { L,2 ) 

650 DO 705 K=1 ,N 
660 SWAP=A( K, JROW ) 

670 A(K,JROW)=A(K ,JCOLUM) 

700 A (K,JCOLUM)=SWAP 
705 CONTINUE 
710 CONTINI.'E 
RETURN 

720 WRITE ?6 ,730 ) 

730 format (20H MATRIX IS SINGULAR) 

740 RETURN 
END 

IF (COMP) 4005,4006,4006 




^ C TIME SERIES WITH CONSTANT DRIFT 

j C parameter card one cols. 192 NB=NUMBER OF PROBLEMS TO BE RuN 
C CARDS 2»394 ARE REQUIRED FOR EACH PROBLEM TO BE RUN 
» C CARD 2 COLS. 2-80 TITLE OF PROBLEM 
C CARD 3 COLS l -80 FORMAT FOR DATA CARDS.. IN THE FORM (FORMAT) 

C CARD 4 COLS. 1-4 NUMBER OF PRE TREATMENT MEASURES. COLS. 5-8 NUMBER OF 

C CARO 5 ETC. DATA CARDS FOR FIRST PROBLEM NO SEPERATION BETWEEN PRE AND POST 
C TREATMENT DATA 

DIMENSION Z( 500) 9Y (500) »X( 500»3 } >XTXIN( 3»3 ) 9XTX(3»3) »XTY( 3) 

DIMENSION THETA(3) 9FMT( IS ) »TITLE{ 18 ) 9PD(200) »XOUT(201,10) 

DIMENSION XX (200 9 10) 9 DELCON ( 200 9 5 ) 9FNAM(595) 

DIMENSION 61 ( 5 ) 9 G 2 (5)9 G 3 (5)9 G4(5)»G6{5)9 07(5)968(5)969 (5)905(5) 

DATA 61/20HT. .CHANGE IN LEVEL / 

DATA G2/20HSCALED POSTERIOR / 

DATA G 3 / 20 HLOWER 99 PERCENT / 

DATA G4/20HLOWER 95 PERCENT / 

DATA G5/20HDFLTA / 

DATA G 6 / 20 HUPPER 95 PERCENT / 

DATA G7/20HUPPER 99 PERCENT / 

XXCK=1.0E-15 
NBB = 0 

READ(5936)NB 

36 FORMAT! 12) 

39 READ (591) (TITLE! I ) 9 1 = 1 9 18) 

1 FORMAT! 18A4) 

JK=0 

NIT=0 

G=0.01 

NBB=NBB+1 

WRITE! 6937 ) NRR 

37 F0RMAT(//9H PROBLEM 12//) 

WRITE(69l) (TITLE! I ) 9l=l9l8) 

READ(59l) (FMT ( I ) 9 l=l 9 l 8 ) 

READ(593)Nl9N29NCC 

3 FORMAT (2 149 II ) 

NT0T=N1+N2 

REA0(59FMT) (Z( I ) 9l=l9NT0T) 

WRITEC69601 )Nl9N2 

601 FORMAT! //8X94H N1=I394H N2=I3//) 

WRITE(69602 ) 

602 FORMAT (15H INPUT DATA) 

WRITE (69600) (Z(I)9l=l9Nl) 

600 F0RMAT(5(1XE13.5) ) 

NNN=N1+1 

WRITE(69603) (Z( I ) 9 I=NNN9NT0T) 

603 FORMAT! //5 (1XE13.5) ) 

IF(NCC)702970l9702 

702 CALL CORREL {NI 9 N 29 Z) 

701 CONTINUE 

WR:TE(69666) 

666 FORMAT (IHO) ■ 

WRITE(6922) 

22 FORMAT ( IX 9 II 6 H GAMMA RESIDUAL VAR L HAT MU HAT STD ERR 

1 MU T FOR MU DELTA STD ERR DELTA T FOR DELTA POST DIST GAMMA) 

25 CONTINUE 

C CALCULATION OF Y SCORES FROM THE DATA ..CHECK FOR UNDERFLOW WHEN GAMMA 
C IS loO AND Y(N) IS NEARLY EQUAL TO Y(N-l) 

Y ( 1 ) =Z { 1 ) 

DO 5 I=29NT0T 



ERIC 
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yy=abs(y( in ) 

IF(YY-XXCK)42,42*45 
45 IF( YY-.OOnOOl )40*40*41 

40 GG=ABS(1«0-G) 

IF(GG-*001 )42*42*41 

42 Y(I)=2(I)-Z(II) 

GO TO 5 

41 Y(I)=(2( I )-Z(II) )+(1.0-G)*Y( II) 

5 CONTINUE 

C CALCULATION OF WEIGHTS* IF ABSOLUTE VALUE OF X IS LESS THAN 1*0E-15* THEN 
C IS SET EQUAL TO ZERO TO PREVENT UNDERFLOW 
DO 1000 I=1*NT0T 

1000 X(I*1)=1*0 
X(l*2)=1.0 
X(2*2)=1.0-G 
DO 6 I=3*NT0T 
11=1-1 

X( I*2)=X(2*2)*X(II *2) 

XXX=ABS(X( I *2 ) ) 

IF(XXCK-XXX)6*6*32 

32 X(I*2)=0.0 

6 CONTINUE 

DO 7 I=1*N1 

7 X(I*3)=0.0 
NN=N1+1 

DO 8 I=NN*NTOT 
II=I-N1 

xn*3)=X( 11*2) 

XXX=ABS(X( I *3 ) ) 

IF(XXCK-XXX)8 *8*33 

33 X(I*3)=0.0 

8 CONTINUE 

C CALCULATION OF X TRANSPOSE X INVERSE 
DO 11 1=1*3 
DO 11 J=l*3 
XTX(I*J)=0*0 
11 XTXIN( I *J)=0«0 
FNTOT=NTOT 
XTX( 1 *1)=FNT0T 
DO 1001 I=1*NT0T 

XTX( 1*2)=XTX(1*2)+X( I*1)*X( 1*2) 

XTX( 1*3)=XTX(1 *3)+X( I *1)*X( 1*3) 

XTX( 2*2)=XTX(2*2)+X( I *2)**2 
XTX(3*3)=XTX(3*3)+X( I*3)**2 

1001 XTX(2*3)=XTX(2*3)+X( I *2)*X( 1*3) 

DO 1002 1=1*2 

DO 1002 J=l*3 

1002 XTX( J*I )=XTX( I * J) 

DET=XTX(1*1)*(XTX(2*2)*XTX(3*3)~XTX(2*3)**2) 

DET=DET-XTX(2 *1)*(XTX( 1*2)*XTX(.3*3)-XTX(3*2)*XTX(1*3) ) 

DET=DET+XTX(3 *1)*(XTX( 1 ,2 ) *XTX ( 2 * 3 ) -XTX ( 2 >2)*XTX( 1 *3) ) 

XTXIN( 1*1 ) = (XTX(2* 2)*XTX(3*3 )-XTX( 2 *3 )**2 ) /DET 
XTXIN{2*2) = (XTX(1>1)*XTX(3>3)-XTX( 1 *3 ) -«'*2 ) /DET 
XTXIN(3*3)=(XTX(1*1)*XTX(2*2)~XTX( 1*2)**2 )/DET 
XTXIN(1>2)=-(XTX(2>1)*XTX(3*3)-XTX(3*1)*XTX(2*3) )/DET 
XTXIN( 1*3)=(XTX(2>1)*XTX{3>2)-XTX(3»1)*XTX(2*2 ) )/DET 
XTXIN(2*3)=-(XTX(1*1)*XTX(3*2)-XTX(3*1 )*XTX( 1*2 ) )/DET 
DO 1003 1=1*2 
DO 1003 J=l*3 






1003 XTXIN( J^n=XTXIN(I»J) 

DO 13 I-l -»3 
THETA ( I '/ = 0.0 

13 XTY(I)=0.0 
DO 14 J=1 93 

DO 14 1 = 1 9.MT0T 

14 XTY( J5=XTY( J)-}-X( I 9 I ) 

C CALCULATION OF THETA... L HAT AND DELTA HAT 
DO 15 J=l93 
DO 15 1=193 

15 THETA( J)=THETA( J)+XTXIN(J»n*«-XTY{ I ) 



YTY=0.0 

C calculation of the RESIDUAL VARIANCE 
DO 18 I=i9NT0T 
18 YTY=YTY-!-Y(I)^--^2 

DO 31 I=l»3 



31 XTY{n=0c0 
DO 19 J=1.3 
DO 19 1=1.3 

19 XTY( J)=XTY( J)+THETA( I )^XTX( I»J) 



FITVAR=0.0 
DO 20 1=193 

20 FITVAR=FITVAR+XTY( I )*THETA( I ) 

S=YTY-FITVAR 

S=5/(FNTOT-3.0) 

C CALCULATION OF THE STANDARD ERRORS OF DELTA AND MU 



SMU=S*XTXIN(1 9l ) 

SDELTA=S^"XTXIN(3»3 ) 

SMU=SQRT(SMU) 

SDELTA=SQRT ( SPELT A ) 

TMU=THETA(1) /SMU 
TDELTA=THETA( 3) /SDELTA 

C CALCULATION OF THE POSTERIOR DISTRIBUTION... 

C OVERFLOW 

SK=AL0G{ S) 

DET=ALOG( DET) 

H= ( -. 5---0ET ) - ( . 5* { FNTOT-3 . 0 ) *SK ) 

H=.4342945*H 

JK=JK+1 

XOUT( JK.l )=G 

XOUT( JX.2 )=S 

XOUT( J:<»3 )=THETA(2 ) 

XOUT< JK.4 )=THETA< 1) 

XOUT( JK.5 )=SMU 
XOUT( J:<.6 )=TMU 
X0UT(JK.7)=THETA(3) 

XOUT( JK^S )=SDELTA 
XOUT( JK»9 )=TDELTA 
IF<NTOT-30)10049l005.1005 
1005 DELC0N( JK 9 I )=THETA(3 )-2.58^SDELTA 
DELCON (JK92)=THETA(3) -1 .96*SDELTA 
DELCON ( JK .3 ) = THETA (3) 

DELCON ( JK 94)=THETA (3 ) + 1.96-5^SDELTA 
DFLCON( JK»5)=THETA (3)+2.58"'«-SDELTA 



logs ARE USED TO PREVENT 



1004 NIT=NIT-M 
PD(NIT)=H 

C INCREMENT GAMMA BY .01 AND ITERATE 

G=G+1.000000n0E~02 
IF(NIT“199)30»30»26 
30 GO TO 25 



ERIC 



30 



OF THE POSTERIOR 



505 

506 



ALL VALUES OF THE POSTERIOR BY THE MAX VALUE 



^08 



509 



5 07 



511 



512 



513 



514 

23 



26 CONTINUE 
C FIND MAXIMUM VALUE 
FIN=PD( 1 ) 

DO 506 I^2»199 
IF (FI N-PD (I))505*506»506 
FIN=PD( I) 

__ CONTINUE 

RESCALE POSTERIOR BY DIVIDING 
DO 507 I=l»199 
PD( I ) =PD( I )-FIN 
YY=ARS(PD( I ) ) 

I F(YY-35« 0)509*508 »508 
PD( I )=0.0 
GO TO 507 

PD( I )=PD( I )/. 4342945 
PD(I )=EXP(PD( I ) ) 

CONTINUE ^ 

CONVERT AREA OF THE POSTERIOR DISTRIBUTION 

AREA=0.0 
DO 511 1=2*199 
11=1-1 

AREA=AREA+«005*(PD( n+PD( II ) ) 

DO 512 1=1*199 
PD ( I ) =PD( I ) /AREA 
DO 513 1=1*199 
XOUT( I *10)=PD( I ) 

DO 514 1=1*199 

WRIT£(6>23) (XOUT( I *J) *J=1 »10) • 
F0RMAT(3X*1F5«2*4X*9E11.3 ) 

DO 700 1=1*199 
XX(I *l)=XOUT( 1*9) 

XX(I *2)=XOUT( 1*10) 

M = 199 
N=2 

TITLe'^FOR^ plot of the POSTERIOR DISTRIBUTION 

DO 760 1=1*5 
FNAM(1*I )=G1 ( I ) 

FNAM(2*I )=G2( I ) 

CALL PLOT (M*N* ISCALE*XX*FNAM ) 

N ^ 5 

C PLOT CONFIDENCE INTERVALS AROUND DELTA 
I F(NTOT-30) 1007 *1006 *1006 
1006 DO lOOS 1=1*200 
DO 1008 J=l*5 
1008 XX( I *J)=DELC0N( I* J) 

DO 756 1=1*5 
FNAMd *I )=G3( I ) 

FNAM(2*I )=G4( I ) 

FNAM(3*I )=G5( I ) 

FNAM(4»I )=G6( I ) 

756 FNAM (5*1) =G7 ( I ) 

1016 FORMAT(//34H CONFIDENCE INTERVALS AROUND DELTA) 

1017 FOt7MAT°/3X^.5HGAMMA.2X»52H LOWER 99 LOWER 95 DELTA 
1 UPPER 99) 

DO 1019 I=l»199 

1019 WRITE<6,1018)X0UT( I.l) »<DELCON( I.J) .J=l»5) 

1018 F0RMAT(3X*1F5.2*1X5E11.3 ) 



TO UNIT AREA BY METHOD OF TRAPEZO 



700 



AND FOR STUDENT T 



760 



UPPER 95 



ERIC 






WRI TE( 6, 101 5) 

. 1015 FORMAT { //IX 9/f7H GRAPH OF CONFIDENCE INTERVALS AROUND DELTA HAT) 
CALL PL0T(M,N,!SCALE,XX,FNAM) 

. 1007 NB=Nn-l 

IF(NB)38*38»39 
38 CONTINUE 
CALL EXIT 
END 

SUBROUTINE CORREL (Nl^NZ^Z) 

DIMENSION 2(500 ) 9 RLAG ( AOO ) « Y ( 500 ) » XX ( 200 » 1 0 ) *FNAM{5*5 ) 

DIMENSION G8(5) 9G9(5) 

DATA GS/20HPRE TREATMENT DATA / 

DATA G9/20HPOST TREATMENT DATA / 

NT0T=N1+N2 

C PREPARATION OF TITLE FOR CORRELOGRAM PLOT 
DO 804 JJJ=1»4 
IND=JJJ 

IF(IND~2)S059S069807 
807 IFaND-4)80598065306 



805 


CON 


TINUE 






DO 


757 


1 = 


195 


757 


FNA 


M(1 


,I ) 


=G8(I ) 




GO 


TO 


808 




806 


CONTINUE 






DO 


758 


1 = 


1,5 


758 


FNAMi' 1 


9l ) 


=G9( I) 



808 CONTINUE 

GO TO {809,81098119812) »IND 

C PREPARATION OF DATA TO CALCULATE AUTOCORRELATIONS FOR PRE TREATMENT DATA 

809 NLAG=N1 
NLOW=l 
NT0P=N1 

VvR ITECosSlS) 

813 FORMAT(//38H CORRELOGRAM OF PRE TREATMENT RAW DATA) 

GO TO 814. 

C PREPARATION OF DATA TO CALCULATE AUTOCORRELATIONS FOR POST TREATMENT DATA 

810 NLAG=N2 
NL0W=N1-M 
NTOP=NTOT 
WRITE(6,815) 

815 FORMAT(//39H CORRELOGRAM OF POST TREATMENT RAW DATA) 

814 11=0 

DO 706 I=NLOW,NTOP 
11 = 11+1 

706 Ynn=Z( I) 

GO TO 709 

C PREPARATION OF DATA TO CALCULATE AUTOCORRELATIONS FOR PRE TREATMENT 
C DIFFERENCES BETWEEN SUCCESSIVE OBSERVATIONS 

811 NLAG=N1-1 
NLOW=l 
NT0P=N1-1 
WRITE{6v800) ■ 

800 F0RMAT{//4lH CORRELOGRAM OF PRE TREATMENT DIFFERENCES) 

GO TO 816 

C PREPARATION OF DATA TO CALCULATE AUTOCORRELATIONS FOR POST TREATMENT 
C DIFFERENCES BETWEEN SUCCESSIVE OBSERVATIONS 

812 NLAG=N2-1 
NLCW=N1+1 
NTOP=NTOT-l 
WRITE(6v803) 




803 FORMAT (//42H CORRELOGRAM OF POST TREATMENT DIFFERENCES) 

816 11=0 

DO 707 I=NLOW*NTOP 

ri=ii+i 

IK=I+1 

707 Y(II)=2(IK)-2(I) 

709 JJ=NLAG*3/4 
DO 716 K=1*JJ 
NUP=NLAG~K 
CROS=0.0 
SUM1=0.0 
SUM2=0«0 
SUMS1=0.0 
SUMS2=0.0 
DO 703 I=1*NUP 
NIND=I+K 

CROS=CROS+Y ( I ) *Y ( N I ND ) 

SUM1=SUM1+Y( I ) 

SUM2=SUM2+Y(NIND) 

SUMS1=SUMS1+Y ( I ) *Y (I ) 

703 SUMS2=SUMS2+Y ( N I ND ) *Y ( N I ND ) 

FNUP=NUP 

DNUM=CROS-(SUMl*SUM2 ) /FNUP 

DEN=(SUMS1-(SUM1*SUM1) /FNUP ) * ( SUMS2- ( SUM2*SUM2 ) /FNUP) 

DEN=SQRT(DEN) 

RLAG(K)=DNUM/DEN 

WRITE(6»704)K»RLAG(K) 

704 F0RMAT(1X*5H LAG=I3*3X»3H R=F6.3) 

716 CONTINUE 

iscale=i 

N = 1 
M=JJ 

DO 650 I=1*JJ 
650 XX( I *1) =RLAG{ I ) 

CALL PLOT(M*N*'"SCALE*XX*FNAM) 

804 CONTINUE 
RETURN 
END 

SUBROUTINE PLOT (M *N * ISCALE»XX *FNAM ) 

C ADAPTED FROM THE SUBROUTINE-GRAPH- OF -PERSUB- WRITTEN BY J.H.WARD*JR. 
C KATHLEEN DAVIS* AND JANICE BUCHHORN *LACKLAND AIR FORCE BASE*TEXAS 
C ADAPTED FOR FORTRAN II BY T.O.MAGUI RE *UN IVERS ITY OF ILLINOIS 

DIMENSION RAT (10) *FND(5*10) *ENCRMT ( 1 0 ) * PA ( 12 0 ) *FMT(3) *PB"( 10) 
DIMENSION XX (200 *10) *AMAX(10) *AMIN(10)*B(10) *FNAM(5*5) *BB(10) 

DATA BB( 1) *BB (2) *BB( 3 ) *BB(4) *BB (5 ) *BB(6 ) *BB(7) *BB(8) * BB ( 9 ) * BB ( 10 ) / 
11H1*1H5*1HD*1H5*1H1*1H0*1HX*1HM*1HH*1HA/ 

DATA PCR0SS*PBLANK*PDASH*PERI0D/1H**1H *1H-*1H./ 

DATA FMT(l) *FMT(2) *FMT(3)/1H *1H0*1H-/ 

IF(N-5)200*201*200 

201 DO 276 1=1*5 

276 B(I)=BB(I) 

GO TO 202 

200 DO 277 1=1*3 
11=1+5 

277 B(I)=BB(II) 

202 CONTINUE 
99 FMS=FMT( ISCALE-1) 

98 DO 100 I=1*N 

AM I N ( I ) =+l • E+37 
100 AMAX( I )=-l.E+37 






C SEARCH FOR MAXIMA AND MINIMA 
APMI N=AM IN ( 1 ) 

ABMAX=AMAX( 1 ) 

DO 101 I=1»M 
DO 101 J----1»N 
COMP=XX{ I 5J)~AMIN( J) 

IF(C0MP)A002 9^003 >4003 

4002 AMIN( J)=XX{ I>J) 

4003 COMP=XX( I >J)-AMAX( J) 

IF CCOMP) 101 >101 >4004 

4004 AMAX( J)=XX( I> J) 

101 CONTINUE 

IF (N-5 )250>251>250 

251 DO 252 J=l>5 
AMAX( J)=AMAX{5) 

252 AMIN( J)=AMIN{ 1 ) 

250 CONTINUE 

N1=N 

DO 108 J=1>N 

C COMPUTE RESOLUTION OF GRAPH 

108 RAT( J)=(AMAX( J)“AMIN( J) ) /llO. 

DO 110 J=1>N 

ENCRMT< J)=(AMAX( J) “AMIN( J) )/4o 
FND( 1>J)=AMIN( J)+*05 
C0MP=AMIN{J) 

IF {COMP ) 4005 9 4006 >4006 

4005 FND( 1> J)=FND( J)-«10 

4006 FND(5> J)=AMAX( J)-*05 
COMP=AMAX( J) 

IF (COMP ) 4007 >4008 >4008 
C PREPARE ORDINATE LABELS 

4007 FND(5>J)=FND( 5>J)-.10 

4008 FND( 2>J)=AMIN( J)+ENCRMT( J)+»05 
COMP=FND( 2> J) 

IF(COMP)4009,4010>4010 . 

4009 FND( 2>J)--FND{ 2> J)-*10 

4010 FND( 3 > J) =AMIN ( J ) + ( ENCRMT ( J)^*2# ) + #0 5 
C0MP=FND( 3i> J) 

IF (COMP ) 4011 >4012 >4012 

4011 FND(3>J)=FN0(3>J)“«10 

4012 FND(4>J) =AMAX ( J ) -ENCRMT ( J )+• 05 
COMP=FNO{ 4, J) 

IF (COMP) 4013 >110 >110 

4013 FND(4« J)=FND(4>J)-.10 
110 CONTINUE 

C PRINT LEFT HAND LABELS 

V/RITE(6>7)((FND(I>J)>I = 1»5)>B(J)>J=1>N1) 

7 FORMAT ( ///1XF6.1 >21XF6* 1> 21XF6> 1 >21XF6 .1 >21XF6* 1 >3XA1 ) 

C PLOT LEFT HAND MARGIN 
DO 4014 I=l>59 

4014 PA ( I) =PDASH 
V/RITE(6>8) (PA( I ) >I=1>59) 

8 FORMAT( 1X>59A2) 

DO 140 1=1 >M 

DO 4015 II=1>120 

4015 PA(II)=PBLAN{< 

DO 121 IX=27>81>27 
121 PA ( I X) =PERIOD, 

NCOMP=I 

4016 NCOMP=NCOMP-10 



o 



I 



^ IF(NCCMP)i24>4017»4016 

• 4017 COiNTI.NUE 

DO 123 IX=6»120»2 

* 123 PA(IX)=PDA5H 

c rescale data POIMTS 
124 DO 135 K=1»N 

ZL=(XX( I9K)~AMIN{K ) ) /RAT (K) +1.0 
L = 2L 

!F(L-1 )60 18 >6019 >6019 

6018 L=1 

6019 IF( 110-L)6020>6021 >6021 

6020 L=110 

6021 IF( PA(U -PBLANK )6022>130>6022 
6022:. IF (PA (L) -PERIOD ) 60 24, 13 0> 6024 

6024 IF (PA{L)-PDASH)6025>130»6025 

6025 PA(L)=PCROSS 
GO TO 135 

130 PB(K)=B(K) 

PA{L)=P3(K) 

135 CONTIMUE 

IF( I SCALE-1) 6023 > 136 >60 23 
6 FORMAT (AD 
6023 WRITE(6>6)FMS 
C PLOT DATA POINTS 

136 WRITE(6>2 )I >PERIOD> (PA( J) > J=l>110 ) >PERIOD> I 

2 FORMAT( 1X> I3,A1>110A1 >A1> 13) 

140 CONTINUE 

DO 6026 I=l>59 

6026 PA(I)=PDASH 

WRITE! 6,8 ) (PA C I ) > I =1>59) 

WRITE(6>7)( (FND( I>J)>I=1>5)>B(J)>J=1>N1) 

142 WRITE (6 c. 3) 

3 FORMAT ( 1HO>10X>16HPLOT DESCRIPT ION/ IHO >7X > 5HT I TLE > lOX >9HCHARACTER > 
14X>7HMINIMUM,4X>7HMAXIMUM>4X>10HRESOLUTION) 

DO 122 J=1>N 

122 WRITE ( 6 >4 ) (FNAM{ J> K) >K=1>5 ) >B ( J ) >AM INI J) >AMAX ( J ) >RAT ( J) 

4 FORMAT ( 1X>5A4 >2X>A1 >7XF8.3 >3XF8 .3 >4XF8 .3 ) 

RETURN 

END 



o 



